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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 48 60 

AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 4920 

AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TAT T CAT ACT TTTCTGAATT TCGAAAAAGA 4 980 

AATGTAAGCG T T T GAT AG AT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 504 0 

AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 5100 

ATAACAGTAT CTCATGCAGC AGATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 5160 

AT CG AAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 52 2 0 

TCTAAAGCAA CAAGTCTTTC TATCTATGAG ATGAATTATC TTCTGAATAA GACAAAAGTA 52 80 

AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 534 0 

ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 

TACGGAGTTA T TAC T G AC CG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 54 6 0 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAAGATGAAG TTGGTGTTCT TGGAAAAATT 552 0 

GTTTCTTTGA TTGTAGAAGA AAATTTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 55 8 0 

GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 564 0 

GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC AG C AAAAGT C 57 0 0 

TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 57 6 0 

GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 582 0 

TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 5880 

CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 594 0 

CCTGATTTTT GGAACGATAA TATTGCGGCC CAAAAAACGT CGCAAGAATT AAATGAATTA 600 0 

AAAAACACTT ACAAT AC CTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 6060 

TTGGATTTTT TGGCTGAAGA CGAGTCAGTG CATGATGAAC TGGTAGCGCA GTTAGCCGAA 612 0 

CTTGATAAGA T AATG AC C AG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACCAC 6180 

AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 62 4 0 

GATATGTTGC TTCGTATGTA T AC TCGTT AT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG 63 00 

TTGGATTACC AAGCAGGTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 63 60 

CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 6420 

CCATTTGACT CTGCCAAACG TCGCCATACC TCTTTCACAT CTGTAGAAGT GATGCCAGAA 6480 

TTGGATGATA CTATTGAAGT GGAAATCCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 654 0 
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TCAGGTGGTG 


CCGGTGGACA 


AAACGTCAAT 


AAGGTTTCAA 


CAGGTGTACG 


TTTAACCCAC 


6600 


ATTCCAACTG 


GAATTGTTGT 


CCAATCAACA 


GTAGATCGTA 


CCCAGTATGG 


AAATAGAGAT 


6660 


CGTGCCATGA 


AGATGTTGCA 


GGCTAAGCTC 


TATCAAATGG 


AGCAAGATAA 


GAAGGCTGCG 


6720 


GAGGTAGATT 


CTCTCAAAGG 


TGAGAAAAAG 


GAG AT C AC T T 


GGGGAAGCCA 


AATCCGTTCT 


6780 


TATGTCTTCA 


CGCCTTATAC 


TATGGTAAAA 


GATCACCGAA 


CTAGCTTTGA 


GGTTGCTCAG 


6840 


GTAGATAAGG 


TTATGGATGG 


GGACCTAGAT 


GGTTTTATCG 


ATGCTTATCT 


CAAGTGGCGA 


6900 


ATTAGCTAAG 


ATAGAAAGGA 


ACT C AC ATGT 


CAATTATTGA 


AAT GAG AG AT 


GTCGTTAAAA 


6960 


AATACGACAA 


CGGAACAACT 


GCTCTACGCG 


GTGTTTCGGT 


TAGCGTTCAA 


CCGGGGGAAT 


7020 


TTGCTTACAT 


CGTAGGACCT 


TCAGGAGCAG 


GGAAGTCAAC 


TTTTATTCGT 


TCTCTGTATC 


7080 


GTGAAGTAAA 


AATCGATAAA 


GGAAGCCTAT 


CAGTTGCTGG 


TTTTAATCTG 


GTTAAGATCA 


7140 


AAAAGAAAGA 


TGTCCCGCTT 


CTACGTCGTA 


GTGTTGGGGT 


TGTCTTCCAG 


GAT T AT AAAT 


7200 


TGTTACCAAA 


GAAAACTGTC 


TATGAAAATA 


TTGCTTACGC 


TATGGAAGTA 


ATCGGGGAAA 


7260 


ATCGCCGTAA 


TATCAAAAGA 


CGAGTGATGG 


AAGTTTTGGA 


CTTGGTTGGA 


T T G AAG CAT A 


7320 


AGGTTCGTTC 


TTTCCCAAAT 


GAACTCTCAG 


GTGGGGAGCA 


ACAGCGGATT 


GCGATTGCGC 


7380 


GTGCAATTGT 


AAATAATCCC 


AAAGTATTGA 


TAGCTGATGA 


GCCAACAGGA 


AATCTGGATC 


7440 


CGGATAATTC 


ATGGGAAATT 


ATGAATCTCT 


TGGAACGGAT 


TAACyTACAA 


GGAACAACTA 


7500 


TTTTGATGGC 


GACTCATAAT 


AGCCAGATTG 


TAAATACCTT 


GCGCCACCGT 


GTCATTGCCA 


7560 


TTGAAAATGG 


CCGTGTCGTT 


CGTGACGAAT 


CAAAAGGAGA 


GTATGGATAC 


GATGATTAGT 


7620 


AGATTTTTTC 


GCCATTTATT 


TGAAGCCTTA 


AAAAGTTTGA 


AACGAAATGG 


TTGGATGACA 


7680 


GTAGCTGCTG 


TCAGTTCAGT 


CATGATTACT 


TTGACCTTGG 


TGGCAATATT 


TGCATCTGTT 


7740 


ATTTTCAATA 


CAGCGAAACT 


AGCTACAGAT 


ATTGAAAATA 


ATGTCCGTGT 


AGTAGTTTAT 


7800 


ATCCGAAAGG 


ATGTGGAAGA 


TAATAGTCAG 


AC AAT T G AAA 


AAGAAGGTCA 


AACTGTTACA 


7860 


AATAATGACT 


ACCACAAGGT 


ATATGATTCT 


TTGAAGAACA 






loon 


ACCTTTTCAA 


GTAAAGAAGA 


ACAATATGAA 


AAATTAACCG 


AGATAATGGG 


AGATAACTGG 


7980 


AAAATCTTTG 


AAGGAGATGC 


CAATCCTCTC 


TATGATGCCT 


AT AT TGT AG A 


GGCAAACACT 


8040 


CCAAATGATG 


TAAAAACTAT 


AGCCGAAGAT 


GCTAAAAAAA 


TTGAAGGTGT 


CTCTGAGGTT 


8100 


CAAGATGGCG 


GTGCCAATAC 


AGAAAGACTC 


TTCAAGTTAG 


CTTCATTTAT 


CCGTGTTTGG 


8160 


GGACTAGGGA 


TTGCTGCTTT 


GTTAATTTTT 


ATCGCAGTTT 


TCTTGATTTC 


AAAT AC CAT T 


8220 


CGTATTACCA 


TTATTTCCCG 


CAGTCGCGAA 


ATTCAAATCA 


TGCGCTTGGT 


CGGAGCTAAA 


8280 


AACAGTTATA 


TCCGTGGACC 


GTTCTTGTTA 


GAAGGAGCCT 


TTATCGGTTT 


ATTGGGAGCT 


8340 
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AT CGC AC C AT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TGTCAACAAA 



8400 



TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 



8460 



GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 



8520 



CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 



8580 



TTTGCTACAA GAGTTTTTGA AAAGAGATGC G C AG AAG AAA AGAGCTTCCA AAGAAGTCCC 



8640 



C C AG AG AAG A CTTC 



8654 



(2) INFORMATION FOR SEQ ID NO: 99: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TGTCGCGTCA AAAT C ATT AC TATGGCTATG TATAGCCCTT ACTATGACTT GGCTAAACAC 60 

GTTCGCTTTC AAATTTCTAG GCTCAGGCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 12 0 

TGCTAAAATC GTTCTTGATC GCTTTCACAT TGTACAACAT CTTAGCCGTG CTATGAGTCG 180 

TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA AGGCTATCAA 2 40 

GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC GATTTTATCG 3 00 

CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 3 60 

AGAAGACTTG AAAC AC C ACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 42 0 

AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 480 

TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 540 

AC ACT AT T CT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 600 

TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTGAACAT 6 60 

CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 720 

TTGACAAAGA GCCTAATTTC CATAAAAATT GACATGGAAA TTATAAAACC ATTACTAGTT 780 

TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 840 

GATTTCTACC AT AG ATGG C T TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 900 

C AAG AC CTTC TTACGTTCCA AGATTGGATC TGGGATTGGT ACCGCTGAAA GCAAGGCTTG 9 60 

AGTATATGGG TGAATTGGAT TGTTAAACAA TTCTTCTGTT TCTGCAACCT CTACAATAAC 1020 
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ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA ACAACCGACA AGTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT TTGGAATTTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 12 00 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 12 60 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 13 80 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 144 0 

ACGAATCAAT TCCGCAGCTT GTTCACGCGA TTTCTTACCA TTAATCTTTT GACCATCAAA 1500 

AATGATATCT CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 15 60 

CCCACTACCG G ACT C AC CT A CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 162 0 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT 18 60 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 192 0 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TT AT CAT C AG 1980 

CAAGCTGAGG CAAGCTAGAC AAG AG ACT C C ATGTATATGG ATGGCGAGGG T C AT AG AAG A 2040 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AAGGCAGTTG 222 0 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG GGCAATAGCA ATAACGATAC 22 80 

GTTGACGCAT TCCTCCAGAA T AT TGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTTTTTC 24 00 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 24 60 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2 52 0 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 25 80 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2 640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 27 00 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 27 60 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2 820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAAGGGTTG TCAATGGAAT 2880 

CCAGAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2 940 

ACTTGGCACT GTAATCGGTA AT C C AAGACC GAAGAAAGAC AAGAAGGCTT CGTATGAGAT 3 000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA GATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTGGTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 312 0 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATGTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG AT T G AGTAAG TCAAGACAAT 3240 

AAC AAT C AAA AGAGGTGGGA TGTTTGAGAT GACGTTGTAA ACTTCCATCA TGACACGGTC 3 3 00 

AACTGATTTT GAAATACCCC AAAT AC C AC C GACAAAAACA CCGATAACCA AGTTAATCAC 3 3 60 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGAGCTCCG AACCAGACAC CGTCAAAGAG 342 0 

CGATTTACCG TTACTGTCAG TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3 4 80 

ACTAAAGTCG TTTACCTTGC TGACATCATT GAAATCAAAC T T AG AAAAC A TTGGGTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTGT TTAAACACTG ATTTC C AGT A AGAATATGCT GGCGCATCAA TAGTTTCAGA 3 660 

GGCAAAATCG TCACGTTTTA CAAACTGAAA TTTTTCTTTA TCGATTGTAG ACATTATTTG 3 72 0 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG T CAT C C AAAT ATCTCCCAAA 37 8 0 

AGACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3 840 

TTAGATGCTT TTACAGAGTC AAT C AAC ATT TTACCCATAC CTGGGAAGGC GAAGACTGTT 3 900 

TCAGTAAGGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCAGC 3 9 60 

GGAACCATGG CATTTTTAAA GATGTGTTTG TTTGAAATTT CTTTTTCAGA CAAACCTTTT 4 02 0 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 4 08 0 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CTGGTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT AGGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 42 00 

CGAACGATGT AAACCAAGGC AATCGTTGGA AGAGCAAGCA AGAAGGTCAA AGCCCCTGTT 42 60 

GAGAGGCTAT C AAT CCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 432 0 

AGAGCATAGG CAAGAACCAA ACCAATCAAA C C AGT AAT AG CAGAGCTGAC AATCATAGAT 43 80 

GGATATTGGT AATTACTTTC AGTCGCTGTA T AAGG AT CAT CTTTCCCATA GCTAGCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGT AAAT ATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTGAACTTGG GCAGTTTTGG TTTGTCCTTG ACCTTGAGTA 4560 
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ATAACCTGAA GAACTGGTGT AT TAGC AT AG GTTGGGTAAG AGTCACCTAA ATTCAAGTTC 462 0 

ACAAAGTTTT GATGAACAAA TGGGAACTGA CTGTTAAAGT ACAAGAGATA TTTATGTTTA 4 680 

GTTCCTGAAC CGACCAATGA CCATCCGATA GCTGGATCAT TTTCAAAACG AAGGTAGCGT 4740 

TTCAAGTCTG GATTTTCAGG GTCTTGGATT TTATTTGTAT GGTCAATGTC AATCAAGTTA 4 800 

GCATAGAAGT GAAAAACACG TTCAAAAATT GGAATTTCAC GAGTAGCATA GAATTGACCA 4 8 60 

CTTTCAGTAA ATTCTCCCAA AGTCCAACCA TGACCTAATT GATTGATGTA CTTTTCATAA 492 0 

ATAGCTTTAT TGGTCGCATT TGCTTCTACT GTTACAGAAG AATCCATGCT ACTTGCCTTT 498 0 

TCTTGCAACT CTTTAGTATC GTAATACTCA ATGTAGCCCA TACGCTCAAA CACAGTATTT 5040 

TCATAGTTAT CACGTTTATC AGCCGTTGTC GCAATTTTAT TATAGTTAGG ATCCTGCTTG 5100 

AAAATCAATT TTCGAGGAAC CAAGGTATAG ATAATCGTGT AGGTCAAAGT CGTTACTAAG 5160 

AAAATCGAAA CCAATGACCG CAAAACACGC ATAAAAATAT ATTTTTTCAT ATTATTTCCT 522 0 

TTAAAAATCC CAAAAGAACC TTCTCCTCAT GGAGAGAAAG TTCTATTAGA AATTATTTAC 52 80 

TTCACATGAC TTGCCAATTC TTTTTGAGCT TTCTCATTTG ATTCAGCTTT TTCTTTCAAC 5340 

CATTTTTCAC GAGCTTTTTC ATACTCTTCC TTAGTCACCA CTTTATCTTG TGATTTCAAA 54 00 

TATTTGAAGT AAACATCTGA CCCCTTAGAG CCTGTTTGCG CAGAAGCTCC AGTAAATGGA 5460 

ACAATTCGTG AAAGCACTGG TGCTGCACCA GAAGAAGCCA TAGCAGGAAT AAAGAGTGAA 5520 

CTATCTGTCA ACCATGCTTG AGCCGCTGCA TATTTTTCAT AACGGACATT CAAGTCGCTT 5580 

GTCTCTCTGG CAGCTTCATC AACTAATTTA TCGTATTCTT TCAAACCAAC TTGAACTACT 5 64 0 

GAAGGGCTAT TTGGATTATC AAATCCTAAA TATGTTTTTG TAGTTTCACT GCTAGTTGTT 57 00 

TTTAAAATAT CCAGGTAAGT AGATGGGTCT TGATAGTCTG GCCCCCATGA AACTCCTCCT 57 60 

GAT AC AT CCC AATCCTCAGA TGAAGCATTG GCAGCATAGT AAGTAATATT AAGGAATTCA 582 0 

TCACTTGTCA TTTGTTGAAT ATCAACAACG ACATTTTCAA CACCAAGAAC TGTTTCTACA 5 880 

GATTGTTTAA AGGACTGAAT ACGAGATATG TAGTTTTTTG ATGCTTGGTC TACTGGAACG 5 94 0 

TCCAGATGAA TAGGAAACTG AACGCCGTCT GCTTCTAAAG CTTTCTTAGC TTTCGCAAAC 6000 

TCTGCCTTGG CCTTGTCAGC ATTGAATAAA CCATCCTGCC CATCAGCTAA ATTCACACCT 60 60 

TTCCACTCAT CACCATAAGC AGGAAGTTGA GCAGCGACTA AATCACCAAA GGTCTTCTCA 6120 

CCAGCTGAAA CAAAGTCTGG TTTTACAAAT AAATTACGAA CTGCTAAAGC TGCTCCATCT 618 0 

TTACCATTGA TTTGAGCTGA GTAAGCTGAG CGATCAAGAG CAAAATTCAA GGCTTGACGG 6240 

AAATCTTTGT TAAGCAATGC CTTCTTAGTA GCTACTTTCT CTGAATCTGT AGTTTTAGAA 6 300 

GTATAGTTGT AACTTTGGCG ATCAATATTC ACACCCAGAC CAGCAATCCC AG AG C C TG AT 6360 
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TGTGTGTAAT AG AT AT TGT C CTTGTATTCT TCTGCAACCT TAGAATAGTT GGAGCTGGTA 6420 

GGGTAAAGAC GGGCATAACT ATAAGCTCCA CTAGTGAAGT TACGCTCTAG CGACTCCTGA 6480 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 654 0 

TATTGCTCAT T TTTT AC AAA C T C T AC AG AA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GG AC C ATT AT AAAGCAAGGA TGTCGGATCT GTTGGTTTAG CAAAATCGCT TCCTTTTGAT 66 60 

GTTTCGAATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTCCAGAAC 6720 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GT AT AAT CAT CAACCGCCTT GACACCAACT 67 80 

GTTGAAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 6840 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TTAGCCGTCA CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCTTATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AATCCTCTGC AACTGCAGGA 702 0 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7 080 

GT TGT ACT AT TTTTACTTGA AATCAGGTAG TCCAAGGTTT CTGGGTCTGC TGTATAAACA 714 0 

TAG CC AT AAG CTTTAGGGGC TGATGAATCA GAT G ATTTTG AAGAACTGCA TGCTGCAAGT 72 00 

ACACCTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 72 60 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 7 32 0 

ATCAAACAAA TTTT C AGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7 3 80 

ATATGATTCA AATTGTCGTT CGAAGTGTCA AAG AC T AC AG TGAAAATAGG AAATTTGACG 7440 

CAGAAACTTT GGAGTTTAGG AAGACATACA GTAAAATGAA ATACGG AC GG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG TATTCAAGAT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TAT C C CG AAT TCAATTACTT TTGTGATTTA 7 62 0 

CATATATAGA TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7 68 0 

TGCTCCTCGC TATCACGTTT CT ATT CAT AG CCTACGATTT CACTATTGCT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG AC T T AAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

GACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC C C AAGGT AG A 7 920 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 8040 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA G AT AAAGGC C GATAATCGGA 8100 
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CGTTTCCCCG ACTCCTGACT CATATCCATC ATCAAGCGAA CAGGAGCAAC AGAAGACAAA 8160 

ACTAATAAAA TAGTCCCCAC AATTCCGTAA CTCAGAATCG TATCAATATA AAGACTGTGG 82 2 0 

GCATGTTCAT GATAAGGAGC ATGTATCCGA GGATAAGAGT TCATATAGGT CAATGGCCCT 82 80 

TCACCCCAAA AAGGATTTTG CTTAAACAAG GCCATCCCAG CATCCCAGAT AGAAATGCGT 834 0 

TCTTCCATAG AAGAGTCTAA AGTACCCATT CGAACTCCCA AATCACTAGA AAAGAGGAAA 84 00 

CTCAAACCAA TCGCGAAGAC CCCAATACTA AG C C AAAAGG CCTTCCAGTT TTTAATAGTC 8460 

GTAAAGAGAT AGATAATTGC TCCAGCGATA ATAGCAGGAA AGGCAGTTCG ATTTTGAGTA 852 0 

AAGTTCAAAC CAAAGAGATT AACAAAGCCT GCAATCACAC AG AAT ACT T T CAACCAATTC 8 580 

AACTTGGTCG TTGTAAACAG ATAGAAAGCA ATCATAATAC AGAAACAACA AATAATTCCA 8640 

TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC CTGCATATTG 8700 

GGTGAAAGAA AAGCATAGTT AAATTTCTTC AC AAT T T GG A AATGTTCTAA ACTGGCAAAA 87 60 

GCAGCTGACA AGACACTACC AAACAAGACA AACTGCAAAA TCAATCGAAA GAATTTATGG 882 0 

GATAAAATCG ACTGATAGTG CAAAAAGAAA ATAGTAAATA GAAACATTCC TACTGAAGCC 8880 

ACAAGACCCA TCCAATTTTG TGCAAGAATG GATATAACAG TACTATAGCT AAGAAAAAGA 894 0 

AGCAGCATCG GATGCTCCCC CATTTTCTGA AGAATACTTT TCATGTCTCC TGTAAAAATC 9000 

AAACTGATAA TATATAAACA GAGTACAACT ACAAAAAGAT AAAAGGGTAA AAAGATACTC 9060 

AGGATAATTC CCAATAAAAT CAGCTCTTTA CTAGACAACC CCTTCAGCTT TTCAATAAAG 9120 

CCTATTGATT TCAAAATGAA TCCTTTCTCT CCAAATCAGC TGATTCAGAT AATAGTAAGC 9180 

TATCCTATAT TGTACCACTT TTTTAGCAAT TTGAAAACAA AGGAAACGTT TTCCAAAATA 9240 

AAAACCCTAT TTTATCCACC ATATCAAGGC TTCAAAATGA TACTTCAACT CCATTCTCAA 93 00 

TTACCCGATA AGTCTGATTT TGCAAATCAA T T TC T ACT AC TGCTGTTACG GACTTATCTT 93 60 

TATTTTGACG TTTGATTACA ATGCTGTGAG CTGTTGGTGT CTCTATCTCA GTAGTCCCTT 9420 

CTAGATCAAA GGCTTCTGAA CGGTTACGGA AAGAAAATAG ATTGAGAAGG GCCTTCACAA 94 8 0 

CAGGTCGTTG CACTTCTTTT GCTATTTCCT CGTTGCTATA GTAATGACGA TTAATATTTC 9540 

GACCTTCTTT AGTTTCTTCT AATAATTTCA AGTCATTCTT GCCTGCTAAT AGACCCACAT 9 600 

AGTAAATCTG AGGAATACCT GGGGCAAAAG CTTGAATTAG ACGAGCGAGA AAAT ACT TG A 96 60 

CATCATCATC TCCAAGCGCT GAATAGTAGG TTGAATTGAT T T GGT AG AT A TCTAAGTTGT 9720 

TATACTCGGC ACTAGAGTAC TTACGTTTGA CATTGGCTCC AACCTTATAG AGTTCATTTG 97 80 

AAGCATAGTC AATCTCCTCA TCGGTCAGGA TATCCTTGAC AT C TACT AC T CCAATCCCAT 9840 

CATGGGTATC TAGCGTCGTA AATTGCTTCA TCGGGCTCAT CTTTAACCAC TTAGCCAAAC 9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGGAAGAGCA AAATCATAAA 99 60 

CATAGTAATC ATGGTCTGCT ATTTTAAACT GAATCGAATA GTGTTCATGA ATCTCAGGTA 10020 

AAAGCTCTGT CCCATACTCA G C AG CG AT AT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGGTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT GGTCTTACGG ATAAATTCCA 10200 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA AC C AT CTTC A AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 10320 

AAATTAAATC T AC AT C AG AC TGTGTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 10380 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 1044 0 

ATTGACGAGA AATATGATTA AT C AT AAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 105 60 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA AT AG CATC T C 1062 0 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG G CAT CAT T AC TCTCCTTTTT CTAATTGAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAAT T C ATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 108 60 

ATTATCAACA AGCTAAACTC TTCCTCTGTG T C AAAG ACT A TAGATTCCAT GAGCTCTTCT 1092 0 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAG GTATGGTTAC 109 80 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TGAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAGAGATAGA CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGGTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 112 2 0 

AAGGCAGAGT CCACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAAGTCAT 112 80 

TCC AGAAC C A AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 11340 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 114 00 

GACTTTCTGG AATCGAG AT T TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 114 60 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA AGGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATG AC TTGG A AAGGTACGAA GATTCCGAGG AT T AAGAGGG 11580 

TATACATGAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 1164 0 
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TAGGGATAAA GATCATTACT GCAAGTAAAG ACAAGACAGT GATGACGACA GAGTTCCAAT 11700 

AATAGCCTCC AAT C C CAT C A GCTAAGAGAC GGCTAAAGTT GTCCCATGTG AAGTTGGTTG 117 60 

G AAAGC C AAA GAAATTATCT ACAATATCCT TAGTGGGTTT GAAGGAACTA AAGAGGGTAG 11820 

CAAGGAGCGG CACTAAAATC AGAACCGATC CTAGAATCAA TAG AAT GT AT TTGCCAATCA 11880 

GGGCTTTTCT TTCATCTTGT TTCATCATGC TTCTCCTCTT AAATTTCAAA TTTCTTAGAT 11940 

ACTCTCAATT GGATGATCGA AAT C ACT AC A ATTAAGAAGA ACAAGATTAC GGCAATGGCA 12 000 

TTGGCATAAC CGAATTGGTT GTTTTTAAAG GCATAGTTAT AAACCAAGAG CCCAAGTGAG 12 060 

GTTGTGGCAT TGTTTGGACC ACCACCGGTC ATGGCAAAGA CTTGGTCAAA GGCAGTCAGC 1212 0 

CCACCTTTTA GGGCTAGGAT AAAG AC CAT A GAGACACTTG GTAGCAAGTA AGGCAATTCA 12180 

ATGTT C C AG A AAACTTGCTT GCTAGTCGCA CCATCAATCC TTGCTGCCTC TGTAATCTCA 12240 

GTTGGAATAG ATTGCAAACC AGCTAGGAAG ATGATGATGG G CAT AG C C AC CCCTTGCCAA 12300 

AGAAGGACAA AGACAGCCGC AAAGATTGCT CCCCACTTAG TCCCTAAAAG ACTGGTTTGG 123 60 

AAAAATTCAA TATGAAGGGC ATT T C C AAT C GCTGGAAGAC CGTAGTTGAA GACTTGCTTG 1242 0 

AAGATCAAAG CCACTGTCAA ACCAGATAAA ACAGCTGGGA AGAAGAACCA AGCACGGAAG 124 80 

AAGGTTTGGC CTTTGATTTT AGAATTCAAG ACACGCGCAA TGAAGATCCC GAGTGCAATC 12 540 

TCACCAACCA CCATGGCAAT CGCAATGATT GCGGT AAAGC CAATCGCATT CATGAATTTT 12 600 

GGATCCATGA AGAGGAGCTT AAAGTTGTTT AAGCCAACAA ATTTGTAGTT ATAAGTCAAT 126 6 0 

CCTGTCCAGT TGGTAAAACT GTAAAAGGCT CCTTGAAACA TCGGCACATA GAAGAAAATT 12720 

GCTTGTAACA AGAGGGGGAT GACCACAAAA GCCCATGCCC AATATTTTTG TAATACTTTT 12 7 80 

TTCATAGTCT CTCTACTCCT AATCCACATC CGCTTTCATC GGGTTAAAGA AGGCATTCAA 1284 0 

AT C ATTGACC ATGCCTTGTT TATCACCGGT CAAGACATAG TTCATGGTCA AGGTATGGAA 12 9 00 

GTCTGCTTCA CTGGTCCAGT ATTGTTGCAA CCAGACCAAG TGACGATCCG TAAAGGCATA 12 9 60 

TTCGGTCATA CCAGCAAGCG GTGAATCTTC TCCTGCTTGT TTGACCCCTT CGATCGCTGT 13 02 0 

TGGAGATCCG TCCACATCGT AGTATTTTTG CATGACTTCT GGACGGGTCA TATATTCCAC 13080 

AAAGGCATTG GCTTCTTTTG GATGTTTGGT GGTGGCTGAG AT AG AC CAT G CCAAGTCTCC 1314 0 

CGCACCAACG GTTAAGCTTT GTCCTTTTTC TTTTCCTGGA AT CATG AAGG TCCCAATCTT 13200 

AAAGTTCGGT TTTTGTTCAT TAATCGCTGT GATCGCCCAA GACCCATTTG GTGTCATGAG 13 2 60 

GACATCCCCA CGTGCGAAGG CTCCGATAAC ATCGGTATAG CCAGCACCTT CCCAGTTCTT 13320 

TTGCTTAGAT CCATTGATGC GAAGGATGTC CATGACCTTG ATATCATCTT TCATAATCGG 13 380 

AT C CGAC AAT TTAATGGCAT TTGGTTGAGA ATAACGAAGG TATTGATTTG CTTCTTTTCC 13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13 500 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13 560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13 62 0 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13 680 

TTCAGCGTAG CCATTTTTCA CGCGTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13 740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CAT T G ACG AC 13 8 60 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13 92 0 

GGTCATTTCT TTTTTCTGGT TGAAATACTC G ATGGT C ACT GTGCCATCCG CAGATTTACC 13980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14 04 0 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 142 2 0 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTG TAATTTATAT AGTCCCTCTT 142 80 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 144 60 

AC AAGT T C AC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 1452 0 

GT TC AT AG C C CAAATTTCCC ATCATTGCTA C AAGGCC AC G TGTTTCTAAT GGTGTCATTC 14 580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGAGCCCC C AT AG AAAT G GTTGGATAGA 14 64 0 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 147 00 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCAGG 147 60 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14 82 0 

GCATGTACTG AT GAG AT TGC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGGAGTTGAT 14 94 0 

CTAAGACACT T T TC AAGT AT TCTACTACCT GAGGATTGGC AAG AT T AAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TG T TG ACG AT 15060 

ACAAATCACT AT C T AC AG AA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 1512 0 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15180 
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CCCAATCACC T AAAGC AC G A TTATCATCAA AACGATTGCC AAACCAACCA TCATCTAATA 15240 

CAAAAAGTTC AATGCCAACT TTCTTAGCTT CATCTGCTAA CTCTAACAGT TTTTCTCTCT 153 00 

GAAAGTCAAA GTAAGTAGCT TCCCAGTTAT TGATTAGAAT TGGACGTTCT TTTTTAGAAA 15360 

ATTCACTTAG CATAATGTGC TTCAGTACAA AATTCTGACT TTCATGACTA ATACCAGTTA 15420 

ATCCCTGATC TGAATGAGTC ACTAAAGCTA CCGGTGTTTC AAAGTATTCC TCAGGAGCTA 15480 

ACTTCCAAGA AAAGTTTTCT GGATTAATGC CAATAGCCAC CCGAACTTCA TTCAATTGAT 15540 

TTTTTTGAAC AAAAGCTTCA AAGTTGCCAC TATACATTAG TTGAATAGCA AAC AC AT TC C 15 600 

CAGCATCCTC TGTGACTCCT TGTTCGCATA GTAGAAGAGC TGGTGTTTGA GCATGACCAG 15 6 60 

AAGCACCTCG GTTTGAACTA ATCGAAAAGA TTCCTTGTTC TACCTGTTGA CGTCTAACAG 1572 0 

TCTTTTCACG AGCATAAGCA CCCTGCAGAG TTACTATTTC GTAATCTGCA GCTGGAAAAT 157 80 

CAGCCATAAA AGAAAAATCT TTATGGATGA CAACTTCCTG ATTACTATTA TTATCTAATT 15840 

TACTGTAGCT AGCAATAGTC GCATCATTAT TAAAAGTAGT ATAATACAAA GT C AG ACT AA 159 00 

GTTGAGCCTT AGAATCTTCT AACATTAAGA CAAGAGTCTC TGTATCGTCC ATGCTATGTG 15 9 60 

GAGAAGGTAA GCCCTGTGGA CCATTCTGAC CTTTTAAAAT CTTTGCTTCT ACAAATCGAA 16020 

AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT TGGTTTCCTA AAATCTCCTA 16080 

AGCCATGTTG TCCAAAAATC TGTCGCTGAG TATCTAAACT AAAGGTTCGA TTAGTAGCCG 1614 0 

TTGGATTTCC TGAAAAGGCA TGGTCTCGTT CATAAACACT ATTGGAACCT TTATAGTTCT 16200 

TAATAGTCTT TCCTAAATGT TTCAAAAGTA AGTAGCCATT TCGATTTTCA ATAATCAAAC 162 60 

T T AG ATT TT T ACTCTCAACA TAAAATAGAT TATTCTCTAT CCTAACTCCC ATTTACTTCA 16320 

CCTCATCACT TTATTGATTA TATTTTATCA CCTGAAATCG CTTTCCAAAA TAGAAAAATG 163 80 

TCTCAAGAAT ATGGTAAAAT GTTAGGTAGG AGGTAGCACA TGTTAGTTTT TTCAGAATAC 16440 

CAGACTGGAA CAATCGACCT TGCCCTAAGC TTTTATGGAT ATGAGGAATG CACACCTAAT 16500 

TACTCTTTTG GTCCAGCCAT TCGTGATACA TACGTTCTAC AT T AC ATT AC TAAAGGACAA 165 60 

GGAAAATTTC ATTACAAGGG TAAAATTGTT GATTTAAAAG AAGGAGATTT CTTTCTATTA 16620 

AAACCAGAGG AACTAACCTT TTATCAAGCA GATAGTAAAG AACCTTGGGC CTACTACTGG 16680 

TTAGGAATCA CTGGAGGGAA AGCCCCTGAT TATTTTGCTC TTTCCCAAAT TTCTGATCAA 16740 

TCCTATCTCA TCCAATCTGA AACTTGTCAT ACCCAGACTA CTGCAAAACT CATCTCAGAC 16800 

ATTGTCCGCT TCGCTCAGAT TACAAAATCA AGTGAATTAG CTCAACTCCA TATCATGGGA 168 60 

CAACTTCATG AACTGATGTT TCATCTGGGA ACTATTGCTC CCAATCAGAA AAAAAAGAAT 16920 

ATTTCATCAA CCCACCAACT CTATCTTGAA TGCAAACGAT TAATTGATAG CCACTATCCT 16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 17 04 0 

AGCGTATTCA AAGAATTTAA T ACCTT AT C A CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 172 2 0 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17 2 80 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17 34 0 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 17400 

AACAAGAAGG CATTGAAATT G AT AGT AT C A AAAATAGAGG ATATAAACTG ATGAATGGTG 174 60 

ACCTTATTCT TCCAGAGATT C T AG AAG AAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 1752 0 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17 5 80 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17 640 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 177 00 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 177 60 

TAATAGATGT CGACATAAAA TGGGTCAATG AT AT C T AT C T AAACAATCAT AAAATTGGAG 17 820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT AT C ATT AT TG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17 940 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GAT C AT AG AA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 180 60 

TAGGAAAAGA AGTCACTTTC AC ACT AG AG C AAAAAG AC T A CAAGGGACTT GCTAAAGACA 1812 0 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA T AAAC GAT AT 18240 

AAAAATAACT TC AG ATT AGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 18300 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 18360 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 1842 0 

TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 18480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 1854 0 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18 660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 18720 
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TTTGAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTGCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 18840 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18 900 

TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 189 6 0 

AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 19140 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG T C AGGT ACT C 19200 

TATCCAGTTG AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 192 60 

TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 19320 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19 500 

ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 19560 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 19 620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 19680 

GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4117 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 0: 

CCGTGGAAAA GTCTGGATAG TGAATGGTCT T C AC AC AATG ACCTGAAAGA AGCCTGAGAA 60 

TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 120 

AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 

GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 24 0 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 

CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 3 60 

TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 42 0 
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TGGATGCGCG TGAGTGGGTT CGTGTGCTCA AGGAAACGGG CTTCAAAAAG TTGATTTTGG 480 

TGGTCAAGCA CCACGATGGC TTTGTCCTTT AT C C G AC AG C TC AC AC AG AT TATTCGGTTA 540 

AGGTCAGTCC TTGGAGGAGA GGAAAGGGCG ACTTGCTCCT TGAAGTATCC CAAGCTGCCA 600 

CAGAGTTTGA T ATGG AT AT G GGGGTCTACC TGTCACCGTG GGATGCCCAT AGTCCCCTCT 660 

ATCATGTGGA CCGAGAAGCG GACTACAATG CCTATTATCT GGCTCAGTTG AAGGAAATCT 72 0 

TATCAAATCC TAACTATGGG AATGCTGGTA AGTTCGCTGA GGTTTGGATG GATGGTGCCA 780 

GAGGAGAGGG CGCGCAAAAG GTTAATTATG AATTTGAAAA ATGGTTTGAA ACCATTCGTG 840 

ACCTGCAGGG CGATTGCTTG ATTTTTTCAA CAGAAGGCAC CAGTATCCGC TGGATTGGCA 900 

ATGAACGAGG GTATGCAGGT GATCCACTGT GGCAAAAGGT GAATCCTGAT AAACTAGGAA 9 60 

CAGAAGCAGA GCTGAACTAT CTTCAGCACG GGGATCCCTC GGGCACGATT TTTTCAATCG 1020 

GAGAGGCAGA TGTTTCCATC CGTCCAGGCT GGTTCTACCA TGAGGATCAG GATCCTAAGT 1080 

CTCTCGAGGA GTTGGTCGAA ATCTACTTTC ACTCAGTAGG GCGAGGAACT CCACTCTTGC 114 0 

TTAATATTCC GCCGAATCAA GCTGGGCTCT TTGATGCAAA GGATATTGAA CGACTTTATG 1200 

AATTTGCGAC CTATCGCAAT GAGCTCTATA AAGAAGATTT GGCTCTGGGA GCTGAGGTAT 1260 

CTGGTCCAGC TCTTTCCGCA GACTTTGCTT GTCGCCATTT GACAGACGGC CTTGAGACCA 1320 

GCTCTTGGGC AAGCGATGCA GACTTGCCCA TCCAGTTAGA ACTCGACTTA GGTTCTCCTA 13 80 

AAACTTTTGA TGTAATTGAG TTAAGAGAAG ATTTGAAGCT AGGGCAACGA ATCGCTGCTT 1440 

TTCATGTGCA AGTAGAGGTG GATGGTGTCT GGCAGGAGTT TGGTTCGGGT CATACTGTTG 1500 

GTTACAAACG TCTCTTACGA GGAGCAGTTG TTGAGGCACA GAAGATACGT GTAGTCATTA 15 60 

CAGAATCACA GGCTTTGCCT TTGTTGACCA AGATTTCCCT TTATAAAACT CCTGGATTAT 1620 

CAAAAAAAGA AGTTGTTCAG GAACTAGCAT TTGCAGAAAA AAGCCTAGCT GTGGCAAAGG 1680 

GAGAAAATGC CTATTTTACA GTTAAGCGCA GAGAATGTAG TGGTCCTTTA GAAGCTAAGA 174 0 

T T T CG ATT C A AC CGGGG AC A GGTGTCCATG GTGTCGCCTA TCAGGATGAG ATTCAAGTCC 1800 

TTGCGTTTCA AACTGGTGAG ACTGAAAAAA GTCTGACGCT ACCAACCTTG TATTTCGCAG 1860 

GAGATAAAAC CTTGGATTTC TAT C TGAACC TAACGGTGGA TGGTCAGCTT GTGGATCAAC 1920 

TTCAAGTCCA AGTTTCATAA AAGAAGAACC TTTGCGCGAT GCAAAGGTTC TTTTGGTTAT 19 80 

TAGTGACTTG GTAACCAGCT GAGGGTGAAA GTTAGTTGTT CAGCTTTTAA GAGGTCTTGG 204 0 

TGTTGAATAG T TG AT AC GAG TGTTTTGTCC AGTCGGCATT CT TTG AC AAA GTTAAAATGG 2100 

TTGTGGTTTT GTTTAGTATG G AT ATCC AG C CATTTATCTT CTTTAGCGAG GTAGACTCGT 2160 
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AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTGGACA GGTTGGATAA 2 220 

AATCCGAGAG CTG AC C AG AT GTACCAAGCA GAGAGACTAC CATTGTCTTC ATCTCCAGGA 2 2 80 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGAGCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA AC GG AAGAG A TAAGGAATGT GGAAACTAGG CTGGTTGGAA 24 00 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 2 4 60 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT T C AAAAG AT A GTTGCTAAAG 252 0 

GTTTCTTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2 580 

GCTTGAATGG CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2 640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCGTC AAATAGCTGG 2 7 00 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTT TCAGCGATTT CTATGTTCTC TAGTTTTTTG 27 60 

GCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2 82 0 

TGGTGGTCGG TAGAGAGGTA ACCTAGTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2880 

ATGCCGAGAG GGTCGGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2 94 0 

AGGTCGGGGG TCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3 000 

GGC AT CAT AC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 3 060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 312 0 

GGGAAGGTGG TGCGGAAGGT AT C C C AG AAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAG C C AG AT C CATGTGGATG GCTTGCCCTG ATTCATTAAT CTCATAAAAA 324 0 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 3 3 00 

CCTGTCTCTA T AATGT C AAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTGAT 33 60 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAGATGAAA 3420 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAGCT 34 80 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3 540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3 600 

TCTACTGTCA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3 660 

TTCAAACGAA T AG AAGC AC C ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 3720 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 37 80 

TGGCGGTGAA AGAGGCTGTC TCCCCCCAGT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 3 840 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3 900 

AGATGTGGAT CAAAAAACCA AG AT CCAT C C TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3 960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 402 0 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATT AT AT C AG TAATAGAGGG CCTTTAG 4117 



(2) INFORMATION FOR SEQ ID NO: 101: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 12 0 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC AT T GT C AT AG GCTACATCCT GAGTTCTTTC 180 

T T T ATT AG C C TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 3 00 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 3 60 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 42 0 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 4 80 

AGGTGGATGC TACTGGCGAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 6 60 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GT T AC AT C G A AGCCAAGGCA GAACGCTTGC 72 0 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 7 80 

C AG CG ACT CT GGCTGATGGG GTGACAGTGA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 84 0 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

C TAT AAC C AT TACTGGTGTT GAGAAACTTC ATGGTACGAC T C AC AATGT A GTCCAAGACC 9 60 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 102 0 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA AC T AG AAAAT CTAAAAGCTG 1140 

TTCATGTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT CAATTTACAG 12 00 
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CCTTGATGAC AGTTGCAAAA GGCGAATCAA CCATGGTGGA GACAGTTTTC GAAAATCGTT 12 60 

TCCAACACCT AGAAGAGATG CGCCGCATGG GCTTGCATTC TG AG AT TAT C CGTGATACAG 1320 

CTCGTATTGT TGGTGGACAG CCTTTGCAGG GAGCAGAAGT TCTTTCAACT GACCTTCGTG 13 8 0 

CCAGTGCGGC CTTGATTTTG ACAGGTTTGG TAGCACAGGG AG AAACTG TG GTCGGTAAAT 1440 

TGGTTCACTT GGATAGAGGT TACTACGGTT TCCATGAGAA GTTGGCGCAG CTAGGTGCTA 1500 

AGATTCAGCG GATTGAGGCA AGTGATGAAG ATGAATAAGA AATCAAGCTA CGTAGTCAAG 15 60 

CGTTTACTTT TAGTCATCAT AGTACTGATT TTAGGTACTC TGGCTCTAGG AATCGGTTTA 162 0 

ATGGTAGGTT ATGGAATCTT GGGCAAGGGT CAAGATCCAT GGGCTATCCT GTCTCCAGCA 168 0 

AAATGGCAGG AATTGATTCA TAAATTTACA GGAAATTAGG CTGGAGAACC AGCCTTTTTC 1740 

TAAAGATAAG GAGAAATATG AACAAAAAAA CAAGACAGAC ACTAATCGGA CTGCTAGTGT 1800 

TATTGCTTTT GTCTACAGGG AGCTATTATA TCAAGCAGAT GCCGTCGGCA CCTAATAGTC 1860 

CCAAAACCAA TCTTAGTCAG AAAAAACAAG CGTCTGAAGC TCCTAGTCAA GCATTGGCAG 192 0 

AGAGTGTCTT AACAGACGCA GTCAAGAGTC AAATAAAGGG GAGTCTGGAG TGGAATGGCT 1980 

CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA TGCCAAGGTT TCAAGTAAGC 204 0 

CCTACGCTGA CAATAAAACA AAGACAGTGG GCAAGGAAAC TGTTCCAACC GTAGCTAATG 2100 

CCCTCTTGTC TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGGG AATGGTTCAA 2160 

CTTCTTGGAC TCCTCCAGGT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 2220 

CAGTCGATAG AGGTCATTTG TTAGGCTATG CCTTAATCGG TGGTTTGGAT GGTTTTGATG 22 80 

CCTCAACAAG C AAT CCT AAA AACATTGCTG TTCAGACAGC CTGGGCAAAT CAGGCACAAG 2 340 

CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAAGCC TTGGACCAAA 24 00 

ACAAGCGTGT CCGTTACCGT GTAACCCTTT ACTACGCTTC AAACGAGGAT TTAGTTCCCT 24 60 

CAGCTTCACA GATTGAAGCC AAGTCTTCGG ATGGAGAATT GGAATTCAAT GTTCTAGTTC 2 520 

CCAATGTTCA AAAGGGACTT CAACTGGATT ACCGAACTGG AGAAGTAACT GTAACTCAGT 2580 

AAAAGATACG CCTACACTCC TATGTCACTT ATGGATGTAG GAGTTCTTTT T ACT AGT T T A 2 64 0 

AG C AGGACT A AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC TTCCAGTTTT 2700 

GGGAGAGAGA TGGAAGTTAC TTTGAGA 2 727 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 
ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 
TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 
ATTAATTCAA TTCCTTTTTT TCAGTATGAT AGTGGTATTA TTTTAAGATA CTTGAATTCT 24 0 

AACAATAATA ACTTGAATTT T C AAT AT AT A GTTCAACTTT TAATAGCATT TGTTATTATT 300 
TATTTTCCTT TGAGTCAACT ATTACAGTTT TTGACACCCA AT AT T AT TGT TCGTAGTATA 3 60 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT ATATTATATA TGATAGGAAG GACGAAATAT 42 0 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATTTACGTGG 480 
TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA TCAACGATCT 540 
ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 6 00 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAG AT ATT C T 6 60 

AGAACTAAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT ACAGAATAAA 72 0 

ACAGTATTAG T ATT AGG AG C AGGAGTCTTA GGATGTTATA TATCTCTAAG TCTAAGTATG 780 
TATGGAGTGA GGAAACTTAT TGTCGCTGAT TACGATATAA T AG AAC CAT C AAATTTAAAT 840 
AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AG AT TAATG T TCTTTCTGAA 900 
AAAATACACA AGTATAATTC AGATGTTCAG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 9 60 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGAGTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CCATTGATAT TATAAAAATT GTCAATCAAT TTGCTGTATC G CAT A AG AT A 108 0 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTATTA TTGATAATAT AT AT AT CCC T 1140 

ACCATCGGTT CTTGCTTTGG TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC G ACT AC AC C A GAGATGCCTG CTATTTTGGG AGGGATAATG 12 60 

ACTAATTTAA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AG AT AACG CT 13 20 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 1380 

GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 1440 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 162 0 
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TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGGATATTTT TATTGGTATG ATAATATTTT TAATGCTGTG G AT ATT T ATT ATGTTAGGTA 1740 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGTTTAAA 1800 

AAAATAGGTA T AAT GAG CAT TTGCATATAT AT AATT AT T T TATACTGCTT GAGAATGTAT 1860 

C GT ATT AT C A ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 192 0 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAAGGAGGT GCAATGAGTA TGATTGAAGT TAGCCATTTA TCAAAAAGTT TTGGTGATAA 204 0 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT T AAT AT T C T G ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAAGCG GTG AAT T AAA 2 2 20 

GAGAATTGGA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2 34 0 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 24 00 

GCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTGTACTCT TTCTGGATGA 24 6 0 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTT TAG AATT 252 0 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 25 80 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2 640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2 7 00 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 27 60 

TTCAATTCAT TCATGTGAGC CT AC TTT AG A AGATATTTTT ATCACATTAA CAGGAGGAAA 2 82 0 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTGCTTT CACATATTTT T AT AAAT AT C 2 94 0 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTTTTTC TTTTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 30 60 

AAGAAAAGTA C AATT T AC AA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATTATATTAC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TT AT TGAT AG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3 3 00 

AGGTTATCAG TCTTCCTGCT ATGATTTTAG TTGCTTTCTT ACCGATGCTA TCTGGTTTGG 3 3 60 

ATAAGACAGT TGCGAAGATA ACAGATTATA GTTTTATGGG ACTATTTACT AAGTTTTTCA 3 42 0 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 52 2 0 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 52 80 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG AC C ACGG AG C TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGG AG C AG A TGTACTTGGA 54 60 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 558 0 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGAGGCTA CCGAGTCTCT TTTACCATTT 57 00 

TATTTAAAAT ATGACAG 5717 



(2) INFORMATION FOR SEQ ID NO : 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA AT CTTGC G AC CTGGATCAAG CCCTTCATGA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 12 0 

CTATCACACC CAATACCTAT CTAAGCGCCT GGT T CGT AAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 24 0 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 3 00 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA AT GGTT AT AA 3 60 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTAGCAACC CTTCTTTTAC TGTCTACTGT 42 0 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 54 0 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGC C AAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 72 0 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 780 

TACAGGTGGT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG ATTACTCCTT AT TTGT AT AA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 
TGGTCAACCA AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA C AG AT AC CT C 9 60 

TTCTAAACTC CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 102 0 

AAACCCTAGT CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 10 80 

TGTGGATAAG ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATCAC 1140 

AATCGACAGC ATCGAAGTGG TGAAAG AC T A CGATTTTAAA TCTTAAAAAC CAAAAAAATA 12 00 

CAGTATCCAC ATTCGGTACT GTATTTCTTT TACTCTCATT CTTAAGTTAA AT TAT T AAAA 12 60 

TCCCATATTT GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 1320 

GATCCCCTAT ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 13 80 

TTGGTCGGCT CGTCCAGCAC TAAAACGTTG TTTTCACGAT T CAT C AAG AG ACAGAAACGA 1440 

ACCTTGGCTT GCTCTCCCCC TGATAATACT TGAATCTGGC TTTCAATATG TTTGGTTGTC 15 00 

AAACCACAAC GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 15 60 

CAGACAGCTT CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 1620 

AGTTCTAAAT AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 16 80 

CTCTTCAAGA GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCGCAAC CTTTTGATTG 1740 

CGTTCGAAGG TAAGATTTAA AGGCTTAGTA AG AGG AC GGT CGTAACCAAT TTGCAAGTTC 1800 

TTGGCTTGGA AG AT AAAG CG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT 18 6 0 

GGTTTCTCAC TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 192 0 

GCCATATTAC GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT GAGGTCTGCA 19 80 

ATCTCTTTCT GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2040 

TCTTGGAACT GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC CACATGATAG 2100 

ACAATATTAA TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

TCATAGTTTT GGAGAT AG CG CTTGAGCCAA TCAATATGCT CAGCATCCAA GTAGTTGGTC 2220 

GGCTCGTCCA ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 22 80 

GTTCTTTGCC CACCTGACAA AG AAG TT AC A TCCGTATCCA TGCCAAAGTC CATAACACCA 2 340 

AGAGCACGCG CTACTTCGTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 24 00 

CGGTCTTGAA GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 24 60 

ATTTTCATAT AGAGGTCATT GATACGAGCT TCAGCTTTGA AAAG CT CAT C AAAAGCCGTA 252 0 

CGGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC CAAGTAACCA 2 580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG GCAGCATTTT ACCAGTCACG 2 64 0 

ATACTCATAA AGGTTGATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 2700 

T T G AGG AG AC GGAAGGACAC ATCTTCAAAA ATTGCACGGT CACCAAAACC GTGACTCAGA 27 60 

TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 2 82 0 

AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAAT CAT C AA TCTCAAAGAC 288 0 

GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAAGACTCAC 294 0 

AAGAAAAC T A AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 3 000 

TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 3 060 

TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 3120 

AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 3180 

TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 3240 

AATATGATTA TAAGTCTTCT T CATC ATT AA GGATTTACCG CTGCGACTGC CTTCTGGCGG 3 3 00 

TCACGTTTCA TTGTGTTAGA GCGCAATTGT CCACAAGCTG CGTCAATATC TGTACCATGC 3 3 60 

TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TAT CAT AG AA AGCCAACACG 342 0 

CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 348 0 

TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 3 54 0 

ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG GTTTGTTGTC 3 600 

TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG G AAAGGC AC G GTTAATCTTC 3 660 

ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 3 720 

ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTGATG 37 80 

TGACGAGCAC CGATAGC C AT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 3 84 0 

TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 3 90 0 

TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 39 60 

TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGAGGCAC AGAAGGTACA ACCGATATTA 402 0 

CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT GTTGACGCAT GAGTACAGTC 4080 

TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGT AC C AT C AGCAGACTCT 4140 

TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AG C AATC AAA 4200 

TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GAC TGC AC AC GTTTACGGTA GAGCCATTCC 42 60 

CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA AT AC CC ATTC CTGCATGGTT 432 0 

TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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GACGAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 444 0 

CTGCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC TTTTCTAGAA GGTGTTTCTT 4 500 

CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4560 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4 62 0 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTAGGCGA AGCTGTTCGT 4680 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CGCTGATTAA 4740 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4 800 

TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4860 

ATTCAGGTGC AATTTCTTTT CG CAT AG AT A TCCTTTTTTC ACGATTACTT AATACTTTAT 4920 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACGATT 4980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA 504 0 

TTATTTAGCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTG AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAG TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG TCTTGGTTTG CTGTTGAAGT 52 2 0 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 52 80 

TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5 34 0 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 54 00 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT G AT AAGTGC T TGACGGATAT CAGCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 552 0 

AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 555 8 
(2) INFORMATION FOR SEQ ID NO : 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 60 

G AT AC AAAG C TAGTATCGGA AG AAT CAT AC GTGACTTATG TGAGCGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATGCTTGT T C AG ACC AT A TTCACATGCT TATCAGTATT CCTCCGAAAC 18 0 
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TTAGTGTTTC GTCCTTTATG GGCTATTTAA AGGGCAAGAG CAGTTTGATG ATTTTTGATA 240 
AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTGGTGTAGA GGCTATTATG 300 
TAGATACGGT AGGCCGTAAT CAGAAAGTGA TAGCTGAATA TATTCAGAAT CAATTACAAG 3 60 

AAGACAGAGT AGCAGACCAG CTCACGTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 42 0 

TAAATAAGAG GAAGTAAC T A AGGTGCTTTA GCACCTGCTC GGGAAAGTGG TGCGCGAGGA 4 80 

AGCTATTTCG GTGGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 540 
CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 
TATAAAGGAT GGTAAAATTC CTGTTGTCCG ATTTGGACAA TATCCTAAAT AGTTACAATA 6 60 

T ATGGTC T AT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 72 0 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTCGTTTAC AAATTCAGCT 7 80 

TATGCCAAAA TTGAACGGGG TGAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTCTCA 84 0 

GATTTCTATG ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 9 00 

CGCTTTAGAA AATAATCTCC TCAATTTCAT AGAGTTTGAA AATGAGTGAG ATTTTTTATT 96 0 

TGCCCTTTGA CAACTGAATA GCCTAAAATG GTACTTTCCT CATTTGTGGA GCAAATTTGA 102 0 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCG ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTGCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TG AT G AC T TC ATCAGTCATG CCACGTTGAA TGTGTGAGTT 12 00 

TGTTAGATAA ACGCAATTAA TCCTCAAAAG GTTCCCCGAA CCTTTTGAGT T CT AC AG AC G 12 60 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTGGAA CGAAAGGAAT 1320 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 13 80 

AACAATTGAG CGATAGCCGT TTCAAGATCC TTGTAGGTGT TCAGCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTGTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTAGACGAT CTCCTTATGG TAACTATTCA AT AC ATGC G A GAATAGAGCA 1560 

CTTATGAACA AATTGCGGCT GATTTTGGCA TTCACGAAAG CAACTTAATC CGTCGGAGTC 1620 

AATGGGTTGA AGCAACTCTT ATTCAAAATG GTTTTACGAT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCG AAGGATTGTA AGGTAAGAGT TTTTTTCTTT CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT TGTCTGTATC 1800 

CTAGTAGAGT GGTGATACTA TGAAGATTAG TAAGAGGCAC TTATTAAATT ATTCCATCTT 18 60 

GATTCCCTAC TTGCTTTTAT CTATTTTGGG CTTGATTGTG GTCTATTCGA CCACCAGTGC 192 0 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGGTT CGAAACCAAG GAATCTTTTG 19 80 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 2 040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCAGCTGAG TACTTAAAAA T CAT T ATT AT TTGGTATTTA GCTCACCGAT TCTCCAAACA 2220 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 22 80 

TGCTTTTAAT GATTGGCGAT TCGTTCTCCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 2340 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATT ATG T AT A CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 24 60 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2 52 0 

TGTAGCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2 580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGGTCTTGG 2 640 

AAACTCGATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2 700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 2 7 60 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2 82 0 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAG GGATTTCGGG 2 880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2 94 0 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCGCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3 0 60 

GATAGTTTAT GTCTCTTCAA AAAT TAG AAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 312 0 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG ATATTACTAA AAATATGCTT GCCCCAGAGA 318 0 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 32 4 0 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA AT C AAT CATC 3 3 60 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTGGTAGCAG 342 0 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC AC T T G AAT GT TGTCCCTGTT TTGACAGCCC 3480 

AT CC AAC AC A AGTGCAACGC AAAAGTATGT TGGATTTAAC AAAT CAT ATT CATAGTCTTT 3 54 0 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3 600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3 6 60 

TGACTAACGA AATCACGAAT GCTATGGAAT AT TAT AAC AG CTCCTTTTTG AAAGCTGTAC 3 72 0 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTGAAT TTAAAACAGG 3 780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCG TGATGGAAAT CCATTTGTTA 3 84 0 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3 900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3 960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 402 0 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4 080 

TG AT TG AG G A TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGAGATTCT CTACTAGAAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA AT T G AT AT GC 42 60 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4 32 0 

ATTCTCGTTA TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 43 8 0 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA AT C AG AATT A TTAGCAAAAG 4440 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA GTTGGGAGAT GATGTCATCC 4 500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4 62 0 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 4680 

TTGCCAAAAA ATGG AT TG AC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4 740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4 8 00 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4 8 60 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 492 0 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4 980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 504 0 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 52 2 0 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 53 4 0 

AATT TAT C AA T AAAAAT CCA GAG AAT AT TG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 54 60 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT T AAGG C CATC TATGAGACTA 552 0 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGGC TATTGAAGGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 564 0 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 57 00 

AT CAAG AACG ATTGATTCAT AT C AC C AT C A ACGGAATTGC GACAGGATTG CGTAATTCAG 57 60 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 582 0 

TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 5880 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTGAATG 594 0 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 606 0 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 612 0 

GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 62 40 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 63 0 0 

AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA T AAAGGG AT C 636 0 

CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 642 0 

TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGGTTGC AAATGGGAAT 6480 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 654 0 

CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAAGATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAAGTT 6 660 

GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 672 0 

GAAGACAATT CAAAA 6735 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

C T AG AGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 6 0 

CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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GCTAGTTGCT GCTGAAAAGG ACTTGTCTTT TGAAGAAGTC ATTGCTACTA AAAAC AC G AT 180 
CACTGGTCGC CTCAACGGTA AAAAC TATGC GACTGTTGCT GGTACAGGCG TAG AT AT C TT 240 
TGATGCGGAA GCTCACTTCC TTTCAAATAA AGTCATCGAA ATCCAAGCTG GTGATGAAAA 3 00 

GAAAGAACTG ACTGCTGAAA CAATCGTCAT CAACACTGGT GCTGTTTCAA ACGTCTTGCC 3 60 

AATCCCTGGA CTTGCTACAA GCAAAAACAT CTTTGACTCA AC AGGT AT C C AAAGCTTGGA 42 0 

CAAATTACCT GAAAAACTTG GAATCCTTGG TGGCGGAAAT ATCGGTCTTG AATTTGCCGG 480 
CCTTTACAAC AAACTTGGAA GCAAGGTCAC AGTCCTAGAT GCCTTGGATA CATTCCTACC 54 0 

TCGTGCAGAA CCTTCCATCG CAGCTCTTGC TAAACAATAC ATGGAAGAAG ATGGCATTGA 6 00 

ATTGCTTCAA AATATCCATA CTACTGAAAT CAAAAACGAT GGTGACCAAG TGCTTGTCGT 660 
AACTGAAGAC GAAACTTACC GTTTCGACGC CCTTCTCTAC GCAACTGGAC GCAAACCAAA 720 
TGTAGAACCA CTTCAACTTG AAAATACAGA TATTGAACTA ACTGAACGTG GTGCTATTAA 7 80 

AGTAGACAAA CACTGTCAAA CAAACGTTCC TGGTGTCTTT GCAGTTGGAG ATGTCAACGG 840 
TGGCCTTCAA TTT AC T T AC A TTTCACTTGA TGACTTCCGT GTTGTTTACA GCTACCTTGC 9 00 

TGGAGATGGC AGCTATACAC TTGAAGACCG TCTCAATGTG CCAAATACTA TG T T CAT C AC 9 60 

ACCTGCACTT TCACAAGTTG GTTTGACTGA AAGCCAAGCA GCTGATTTGA AACTTCCATA 1020 

CGCTGTTAAG GAAATCCCCG TTGCAGCAAT GCCTCGTGGT CACGTAAATG GAGACCTTCG 1080 

CGGTGCCTTC AAAGCTGTTG TCAATACTGA AACAAAAGAA ATTCTTGGAG CAAGCATCTT 114 0 

CTCAGAAGGT TCTCAAGAAA TCATCAACAT CATCACTGTT GCTATGGACA ACAAGATTCC 12 00 

TTACACTTAC T T C AC AAAAC AAATCTTCAC TCACCCAACC TTGGCTGAGA ACTTGAATGA 12 60 

CTTGTTTGCG ATTTAAGTTG AGATTTAATC GTATCGAACA GCCCTCTTTG GGCTGTTTTT 1320 

ACTTCTGCGG AATCTCAAAT CTGTCTTTCT CCTCTTTTAT GATATAATAG AAACATGAAC 13 8 0 

TTAAAAACTA CTTTGGGCCT TCTTGCTGGG CGTTCTTCCC ACTTCGTTTT AAGCCGTCTT 14 40 

GGACGTGGAA GTACGCTCCC AGGGAAAGTC GCCCTTCAAT TTGATAAAGA TATTTTACAA 1500 

AACCTAGCTA AGAACTACGA GATTGTCGTT GTCACTGGAA C AAAT GG AAA AACCCTGACA 1560 

ACTGCCCTCA CTGTCGGCAT T TT AAAAG AG GTTTATGGTC AAGTTCTAAC CAACCCAAGC 1620 

GGTGCCAACA TGATTACAGG GATTGCAACA ACCTTCCTAA CAGCCAAATC TTCTAAAACT 1680 

GGGAAAAATA TTGCCGTCCT CGAAATTGAC GAAGCCAGTC TATCTCGTAT CTGTGACTAT 1740 

ATCCAGCCTA GTCTTTTTGT CATTACTAAT ATCTTCCGTG AC C AG ATGG A CCGTTTCGGT 1800 

G AAAT CT AT A CTACCTATAA CATGATATTG GATGCCATTC GGAAAGTTCC AACTGCTACT 186 0 

GTTCTCCTTA ACGGAGACAG TCCACTTTTC TACAAGCCAA CTATTCCAAA CCCTATAGAG 192 0 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 19 80 

ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 204 0 

GGTGCCTATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AGAATACGGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 222 0 

TTCCTAGGTG CCGATTCGCA ACT CATC AAA CAGGGATTTG ACAAGAGCCG TGCTGTCTTT 2 2 80 

GGACGCCAAG AAACCTTTCA TATCGGTGAC AAGGAATGTA CCCTTGTCTT GATTAAAAAT 2 34 0 

CCAGTCGGTG CAACCCAAGC TATCGAAATG ATC AAAC TAG CACCTTATCC ATTTAGCCTA 2400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 2460 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACGCTGGCGG TGTTCGTCAT 2 520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT C ACT G AAAC G 2580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2 64 0 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTGTT 27 00 

AGAAAGGAGA TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 27 60 

ATC AG C TC AA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG AC AAT GG AAA 2 82 0 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGGAGCC CATGTGACCG TTGACATCGT 2880 

TTCTCTCCAT GAT G AC TT TG ATGAAAATCA CTACGACATC GCCTTTTTCG GTGGTGGTCA 2 94 0 

AGACTTTGAA CAAAGTATCA TTGCAGACGA CCTACCTGCT AAAAAAGAGA GCATTGACAA 3 000 

CTACATCCAA AACGACGGTG TAGTTCTGGC TATCTGCGGT GGTTTCCAAC TATTGGGTCA 3060 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG G AC ACT AC AC 312 0 

GCTCAACCAG ACCAATAACC GTTTTATCGG TGACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAATCACCA AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3 24 0 

ACCGCTGGGA CAGGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3300 

TCATTATAAG AATGTCTTTG GTTCCTACTT CCACGGGCCT ATCCTCTCTC GTAATGCCAA 33 60 

TCTGGCTTAT CGCCTAGTTA CTACTGCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3420 

CCCTGCCTAT G AGG AC AT TC TCAGCCAAGA AATCGCTGAA GAGTACAGTG AC GT C AAAAG 3480 

CAAGGCTGAC TTTTCTTAAA CAAAGGAAAA TG AT AT C AAA GAACTCCGTT ATCTTGTCGG 354 0 

AGTTTTTTGT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3 600 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA 3 6 60 
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TCAGAAAAAC TAGACGTCGC TGTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3 720 

TTTTACAGTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTAGCAGA TCAGGAACCT 37 80 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 3 84 0 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3 900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3 9 60 

AAAACAGTTT TTGAGACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 4020 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TGCTGACAAG 408 0 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 4140 

CAAATCGACT TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 420 0 

ACTGGTGATA ATGCTGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 42 60 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 432 0 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAAGATA TTTACAACTT GACTCTTGAA 43 8 0 

GCCCAACAAG CTGCTCTTGA CTTTATCAAG CCAGGTGTGA CTGCTCATGA AGTGGACCGC 444 0 

GCTGCCCGTG AGGTCATCGA AAAAGCTGGT TATGGTGAGT ACTTCAACCA CCGTCTCGGG 4500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA CGACATGGTC 456 0 

ATCGAAGAAG GCATGTGCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 4 62 0 

GTTCGTATTG AAGACTGCGG TGTTGTTACC AAGGATGGCT TCAACCTCTT TACAAGCACC 4680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 474 0 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATGCT AAAGAAATTG GCTGCAATAA 4 8 00 

TCTAACCCTA AGTGTCTGGA AT G AT AAC G A GGGTGCTCTC CGCTTTTATC AAAGACAAGG 48 60 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTGAT TAAGAAGTCA TCTATCAAAA 492 0 

GATGTTAGAA AAAGTTCAAT TTCACTAGAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4 980 

AT AGT AT C AG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT 504 0 

TTAGTTCTGT ACTGCACAGG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG 5100 

TAGTAATCAA TATAGTCTAT AATGACTTGT TCCAATTGGT TAAGTGATTT AAATGTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATT C C AT CATACCGTTG 52 2 0 

TCTTGGCTGT TTCCCTTGCG TGACATAGAT GCTTGAATTC CCTTATTCTC TAGGAACCGA 52 8 0 

TGATAAGAAT CGTGTTGGTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5340 

TGCTTCTCTT TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 5400 

GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA TAAGTAAAGC 54 60 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 552 0 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 564 0 

CCTTGAACTC TTTTATGATT T AC C AG AT AA CCACGATTTC TTAGTTCTAA ATGAACCCGG 57 00 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 57 60 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGT CCA ACGAGCTAGT 582 0 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 58 80 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 594 0 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 606 0 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 612 0 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 624 0 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 6 3 00 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 6 3 60 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 64 2 0 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 64 80 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO : 10 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT AT C AG AG AC T 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA C TTGGG AC AT 12 0 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 3 00 
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AGCATTTTCT ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 3 60 

TAGGAGTTTA AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 420 

TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 480 

ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 540 

TCCATGCCTA GAGAAGACTA CCAACGATTT ATT AAC AT T T TTCAAAAGGA AAAAAGCAAG 600 

TATAAGCTCC TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 66 0 

GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 72 0 

ATCGATATTT TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC T TGTT AT AAA 780 

CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 84 0 

CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 900 

TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 9 60 

ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGG AAGTC T TCCCAAGTGG TACCTTTGAT 102 0 

AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 10 8 0 

ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACGCTTCTAC 1140 

AGTCATGAAT TTCACGCTTA TAAATTGGAG GATTAGGATG CAATATTTAG AAAAAAAAGA 1200 

AATTAAAGAA ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 12 6 0 

TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGAGCC ATCCGCCACA AAGGTATGAT 13 2 0 

TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCGTGAG GATTATGAGC GTTTACTGAA 1380 

GAT TAT T G AA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 144 0 

GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 1500 

GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 15 6 0 

CTTGAGCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG C T TAT AT C AA 1620 

AAAATCACGA GCAGTTCACG GTGATAGCAA ACTAAAAGAT T T T C TT AG AT TATGTAGCTG 1680 

GTACGCTCTC CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 174 0 

AAATGCTGTA ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 1800 

GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 18 60 

TATGTTGCCT GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 192 0 

GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 1980 

AAACTGATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 204 0 

TAGTAGAAAA T G AAAT AC AT ATTCCCACAA T AAAACG C AT CATATCAAGG TTTTTGAAAA 2100 
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AC CTTG AT AT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 222 0 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2 2 80 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2 3 40 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 24 00 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA AC TTGGG AT A 24 6 0 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GCAAGCCATG AC T AG C AAAG 2 52 0 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2 5 80 

AAGTTCAAGG CTTCTTCTTT GG CAT CATC A GCGATGGTAA AGACGACATT ACCAAAAGTT 2 640 

GGCAAGTGTA GAT AAGAAG C TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2 7 60 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2 82 0 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2 8 80 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2 940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3 000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3 0 60 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACGTTCG 312 0 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 318 0 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3 240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3 3 00 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3 3 60 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3 42 0 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 34 80 

ATGGT CAC AC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3 54 0 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3 600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3 7 80 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3 840 
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GTTTCGGAGT CGAACTCTGC CGCACAAGTG T CT AC C AT C T TATAAACTGG AACAATCTTG 3900 

TTTTCCAAGC GAAGTTGGCG AACTTTATCA TCAGTCGTTC CCCAGAGTTC AGCAATCTTA 3 960 

CGGTCTGAAA AACCATTAAG TTTGGCTGTT TTCAAAACTT CTAAATCTTG TGGATGAGCA 402 0 

CCCAATTCTT GCTCAATTTC AAAGATATGC AAGAGTTTAT CAAGATAGAA GATATCAATT 4080 

TTTGTAAGCT CTGCAATTTC TTCAGGTGTG TAGCCACGAC GAATGGCTTC TGATACGTAG 4140 

AAGAGACGGT CATCTTGGGC TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 42 00 

GCAAGTTCAG GTATTTCATT GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 42 60 

AGAGATTCCT CGATGTTACG AC CG AT T GC C ATGACTTCTC CAGTCGCCTT CATTTGTGTA 4320 

CCGAGACGGC GTTCACCCTT TTCAAACTTG TCAAATGGGA AACGTGGAAT CTTAGCAACT 43 80 

ACGTAGTCAA GGGCTGGTTC AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 44 40 

TCATCCAAGG TCAAACCTAC TGCAATCTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 4 500 

GCTTTAGAAG CAAGGGCTGA CGAACGTGAT ACACGAGGGT TTACTTCGAT AACATAATAC 4560 

TTGAAGCTGT TAGGATCAAG AGCTAGCTGA AC ATT AC AT C CACCTTCAAT CTTGAGGGCA 4 62 0 

CGAATAATGC TCAAGCTCGC ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTGC 4680 

GCAGGGGCAA ATACAATGGA ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 4740 

TTACAAACAA CCAAGGCATT GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 4800 

AAACCGGCAA TCGAACGCTC AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 4 8 60 

TCAGTGATTT CACGCAATTC TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 4 920 

GTAAAGGCTG GACGAACGAT GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTCT 4 9 80 

TCTACTGTGT TAACAATTTC AGATTCTGGA ATGGGTTGTT CAAGCTCTTC CATCAATTGT 504 0 

TTAAAGAGGT CACGGTCCTC CGCTTGGTCA AT GGC AG AT A ATTTGGTACC CAGAAGTTCA 5100 

ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 5160 

TGACCACCGA GTGTTGGTAG CAAGGCATCT GGACCTTCCT TACGAAGAAT ACGTGTCACA 5220 

AACTCAAGTG TAATCGGTTC AATGTAAACC TTGTCAGCAA TTTCCTTGTC CGTCATGATG 52 80 

GTTGCAGGAT TTGAGTTAAC CAAAACAACC TCATAACCTT CCTCTTTCAA CGACAAGCAA 5340 

GCCTGAGTCC CAGCGTAGTC AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 5400 

ATCACCATAA TTTTTTGAAT ATCAGTACGT T T AGGC AT AT ATAAGATATT AAGGGTGTCA 54 60 

* 

AGCGGACAAA GC T AAAAT AG GAGTTATGAC GAAGAACTGT CAGTTCTAGG AATAACTATC 552 0 

TTTTTAGCAC CGTCCGTAGC CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 55 80 

CTATTCGTCG CCTCT CAGGG CGACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 5 640 
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GAATTTTAGG 


AAATCAAGGA 


AGGATTGACA 


ATCCAAGTTG 


GTTTCTCTAC 


ATTCTGAGCT 


5700 


TTCCGTCCGT 


GTTCAGTTAC 


ATAAATTCTC 


CGACGAGCTT 


TTACTCGTTC 


TTAGTTTGAT 


5760 


TGTTTAAAAA 


CTTCCATCAT 


CTCGATAAAC 


TCGTCAAATA 


GGTAGCTAGC 


GTCGTGTGGC 


5820 


CCAGGAGCTG 


CATCTGGGTG 


GTATTGAACA 


G AG AAAG C AG 


GTTGGTATCT 


GTGGCGCACA 


5880 


CCTTCCACTG 


ACTTGTCATT 


G ATT T C TT CG 


TGGGTAATAA 


TCAAGTGCTC 


TGGCAAATCC 


5940 


TCGCGGCTGA 


CTGCATAACC 


ATGGTTCTGG 


CTGGTGAAGT 


CTACTCGTCC 


TGTTGCGATT 


6000 


TCACGTACCG 


CATGGTTGAA 


TCCACGGTGG 


CCAAACTTCA 


TCTTATAGGT 


CTTAGCCCCG 


6060 


TTTGCCATTG 


CAAAGAGTTG 


GTGTCCCATA 


CAAATACCAA 


AGATTGGAAT 


TTTTCCTTGT 


6120 


ACACCGCGAA 


TCATGTCGAG 


TGCTTGTGGA 


ACGTCTTCTG 


GGTTACCTGG 


ACCATTTGAC 


6180 


AACATAACTC 


CGTCAGGATT 


GAGATGGAGA 


ATTTCTTCAG 


CCGTTGTCGA 


ATAAGGAACA 


6240 


ACTGTCACGT 


TACAGTTGCG 


TTTAGAAAGT 


TCACGTAGGA 


TTGAGTGCTT 


GAG AC C AAAG 


6300 


TCCACTAGCA 


CCACGCTCAA 


ACCAACTCCT 


GGAGCTGGAT 


AAGACGTTTT 


AGTAGAAACC 


6360 


TGTTTGATAT 


TGTCTGTCGG 


TAAAACTGTT 


GCTTGGAGCT 


GGTCCGTCAC 


ATGGTCCATA 


6420 


CTGTCCCCAA 


CATGGGTCAA 


GGTTGCACGC 


ATAGTACCAT 


GCTTACGGAT 


AATCTTGGTA 


6480 


AGAGCACGCG 


TATCAATTCC 


TGAAATCCCT 


GGAATTTTCT 


TGGCTTTCAA 


AAATTCATCC 


6540 


AAGGTCATTT 


GGTTGCGCCA 


GTTGCTAGCT 


CTACGCGCTT 


CTTCAAAAAC 


AACGACTCCC 


6600 


TTACAAGTTG 


GAATAATGGA 


TTCATAATCA 


TCACGATTAA 


TACCATAATT 


TCCTACCAAA 


6660 


GGATAAGTAA 


AGGTCAAGAT 


TTGTCCATTA 


TAAGACTGGT 


CTGTAATGGA 


TTCTTGGTAG 


6720 


CCGGTCATCC 


CTGTATTAAA 


GACGATTTCG 


CCTGTTACAT 


CAATATCTGC 


TCCGAAGGCC 


6780 


TTGCCTTCAA 


AAACTGTGCC 


ATCTTCTAAT 


ACT AG AATT C 


TTTTTGTCAT 


ATTTTCACCT 


6840 


CTCGTGGACG 


CTCACTGGCG 


TCTTTTAACG 


TCTTGTGTTT 


TAGTTGGCGT 


TTCTACTCGC 


6900 


TAGTACGGAT 


TCTAAGATTG 


CCATTCGAAC 


AAAG AC AC C A 


TTGGTCATTT 


GTTGGACAAT 


6960 


CCGTGATTTT 


GGTGCTTCAA 


CCAAGTGGTC 


TGCTATTTCT 


ACATCACGAT 


TGATTGGAGC 


7020 


TGGGTGCATG 


AGGATTGCTG 


TTTCTTTCAA 


ACGATCGTAA 


CGTTCTTGAG 


TCAAGCCATG 


7080 


TTGGGCATGG 


TAGTCTTCTT 


TTGAAAATAC 


AGCTCCACTA 


TCATGGCGTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATGACAT 


CAACCTGATC 


AATGATTTCA 


TCAATGGTTA 


CAAACTGTCC 


7200 


ATAGTCTGCA 


AACTCTTGAC 


TTCTCCATTC 


CTCAGGTCCA 


GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA 


TATTGGATTT 


GGCAACGCGT 


GAGTGGTCCA 


AGTCACCTGC 


7320 


AATAGCAACT 


TTAAGACCCT 


CAAAGTGGCC 


AAATTCCTCA 


TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 7440 

CGTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GGATGGCGAA TC AC AC AG AC 7500 

ATCCACTCCT AAAG C AG AC A GAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7 560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTGCGACTTC 7 620 

AAAGG ACT T A TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7 680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 7740 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT GATTTTCTGA 7 800 

CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT TTTTTCCAAC 7 92 0 

GTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG CTAAACTCAC 7 980 

ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GCACGGATGG TACGACCTGT 8040 

ATAGAGCACA TCATCCACCA AG AT AAC T T C GCGGTCTGTC AC AT CG AC AG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CATCATCACG GAAAGGTTTA GTATCCAATT C C AC AAC AGG 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA T AAAG AC AC C 822 0 

ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT CGATAATCTC 82 8 0 

ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTGTTTT 83 40 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AG AC T T G AAA TTTATAGCCA 8400 

AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTGCCT 84 60 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA CAAAGCATGC 8 520 

TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT CATATAATTG 8 5 80 

TGGGTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8 64 0 

GGCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATGACCC GCTTTTCCAC 87 00 

CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 87 60 

AGT AT CC ACT GCAAAGGCTG GAATTCCAAA TCCTACACTC AT G AC AAC AT TGGCTGTCTT 8820 

GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACGTGTC TGAGGGACTT G ACC AT C AAA 8880 

ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 894 0 

CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG CTTGGGGTGT 9000 

TGGAAAGGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG TCGTCTGGGC 90 60 

TGATAACATG ACCGCAACCA GG AGT T C AAA ATGATTGGTA AAATCAAGAC TAGGCTTGGC 9120 

AT C TGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 
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C ATT ATT CAT CTCCGTCAAA TAGTCCTTGT AAGCCAGCAA AAGGACTGTT TTCTTCTTTC 92 4 0 

TTTACTGCTT TTTGAGCTTG GTATTCTTCC TCTGTCATGA TTTGCCAGTC ATTTCCTGAG 9300 

ATAAATCCTT GACCAGCTTC TTCTTCAGCC GTCAAGACCT TGATAGGAAT GTTTAGCAGG 9 3 60 

ATATTGTCTG ATACACTCTC AGCAAGGTCA AGCTCCCCAT TTTCGATGGG CAAGACCAAG 942 0 

T CATC AT C T A AAACTTCTTG ATCTAGCTGG TTAGTTGCGC CTTCCATGAA AACTTCCGTG 9480 

ACTGGATAAG ATTCAACTAA CTCAACTGGC TCCATACTGC GACTCGACGC AAGAACAATG 954 0 

GTATAAGATA GTTGATAATC TAAGAAATAC ATACGGTCTT CATATTGTAC TTTCCCAACT 9 6 00 

GCAAGGATAT CTTTTACATC TAAAATTTCT TGATTACGTG CACGCAGGTC ATCAACTAAA 9660 

TCTAACGTTT GTTCAAAGTT CAAACCTTCA GACTGCTTAC GAATTTCTTG AATATTTAAT 9720 

TTCATACTTC CTCCATAAAG ATTTACTCTC TTGATTATAC CATGAAAAGG CTACAAATCA 9780 

G C AC ACC AAA CTTTGTAATT AAAATTCAAA ATTTTAACAT ATTTACTATG ATAGTTTTAT 984 0 

TTTTTAGTGC TATACTATAG GGAAAGAGTA CATCAGATCA AGGAGGATGC TCACATGGAA 9900 

GACAAGAAAC TCATTCAACT CCTATCCAAG TTAAATAAAA GCTACCAAAA CTGTAAACAG 9960 

GGTACGGCAG ATGATATTCG ACTACAAGAG CTGCTAAACA C T ACT ATG C A AGAGCTCAAA 10020 

AAAACGGAAC AGTTGAACAA CAGTATCTTA ATTGATCTTG AGAAATTTTA CCAACCTACC 100 80 

AGTCTTCTGA TTGGACTGGG TAGCCTAAAA CTAAACGATC AAGCACGCAC TGCTTGGCGA 1014 0 

AACTATGATA AATTCCATTA CGATCATGTC AAACACGTAC TAAGTCTCTA TGGACCTGTT 10200 

TTTGAATTTT AGAGCATAGA ATTTCCAGTT TTCTGTTGAC AAAATTTCCT TAAAGGTATA 102 60 

ATATAAAGAT ACT AAT ACT C GGAGGTAAGG GAG AC AT G AA CAACTAAGTC TATCAAATAA 1032 0 

AGAACCTTTA TTTAGTAGAT CTTGTTTTTG TCTCTTTTTG TGTGCTCTTT TATGCTCTTT 103 80 

TTCTGGCATG TTAATAGAGT TTTTTTGACA TAGACTTTGG GCT C TACT AG GTAAAGTAGA 10440 

GCTTTTTGTT ATG C AC T ATG AACATTCTAG AAAGGGAAAT CATATGATAA AAATCAATCA 10500 

TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA T C T G AC CT AA C C ATG AC CAT 105 60 

CCAAGACGGG GAAAAGGTTG CTATTATTGG TGAAGAAGGA AATGGCAAAT CAACCTTACT 10620 

TAAAATTTTA ATGGGGGAAG CTTTGTCTGA TTTCACTATC AAGGGAAACA TCCAATCTGA 10 680 

CTATCAGTCA CTGGCCTACA TTCCTCAAAA AGTCCCTGAG GACCTAAAAA AG AAAACT TT 10740 

ACACGACTAC TTCTTTTTAG ATTCTATTGA TTT AG AC T AC AGTATCCTCT ATCGTTTGGC 10 8 00 

GGAGGAATTG CATTTTGATA GCAATCGTTT CGCAAGTGAC CAAGAGATTG GCAATCTATC 108 60 

AGGGGGCGAA GCTTTGAAAA TTCAGCTTAT CCATGAGTTA GCCAAACCCT TTGAGATTCT 1092 0 
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AT T T TT AG AT GAACCTTCAA ATGACCTAGA C CTTG AG AC A GTTGATTGGC TAAAAGGCCA 109 80 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT GATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC AC T ATT GTTC ACTTGCGACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

AGTAGAGCAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGCTAATT TTGCCAAACA 11160 

AAGTCAGCAA GCTGCTAACA ACCAAAGAGC CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 112 80 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGCAGCTCA 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA CTTTTCTTTT CAGACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 11520 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TT AC AG AG AC TTCTGAATGA 11580 

TAAAAGAGAG ATTTCACTTG GTTTTATGCC AC AAG AT T AC CACAAAAAAC TGCAATTGGA 1164 0 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 117 0 0 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCCAGAAATG CAGCATCAAA TTCGCTCCTT 117 60 

ATCTGGCGGA CAACAGGGAA AACTCCTGCT TTTGGATTTA GTCCTGCGCA AACCAAACTT 11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAAT C AG AAA 118 8 0 

ACTCTTTGCT AC C TAT C C AG GCGGTCTCAT CACTGTTTCG CAT G AC CG T C GTTTCTTAAA 1194 0 

AGAAGTCTGC TCGATCATCT ATCGCATGAC AGAACACGGT TTGAAGCTAG TTAATTTAGA 12000 

AG AT T T AT AA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTTACATC 12 060 

TGTTTTAAAC GTTCAATCCG TTCTGAGATA GGTGGGTGGG TATAAAAGAG TTTTTGGAAC 1212 0 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGCACTGC T AGC AT CAT C GACGTGGCGA 1218 0 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA CACTAGATGC AT CTGC C AG A AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12 3 00 

AC C AAGGTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAG C AT AATG 123 60 

ATTTCAAGAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACCCCAC 12420 

C AC AT CAT AC GACCTGCCAT ACTAGAAAGC ATGGTGATAG C AC T AGC AAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA AT ATG ACT G A CTTCATGTCC C AT AAC AGC T 12 540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AG ACC TG AAG T C G C AG C AAC AGCCGCATTT 12 600 

TGAGG AT TAG AACCTGTCGC AAAGGCATTT AAGGC TGGAT CATCAATGAT GAAAACACGG 12 660 

GGCATAGGAA TCTGAGCGAC CAGAGCCATA TCTTCCACTA CATGGTAGAG GTCTGGTGCC 1272 0 
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GTTTGCTCAT CCACCTCACG CGCTCCATTC ATGGACATGA CAATCTCTGT CGATTGAAAA 127 80 

AT CAT AG AC A AAG CGT AG AT AAAGCCGATA ATCAGTGCAA TAACCAAACC ACCAAGTCCA 12 840 

GATCTTATAA AGAGATAACC AACCGCATAA CCAACAAGAG CTAAGAGTAG G AAAAAT AC C 12 900 

AGCAACAAAA TCCAGGTTTT TCGTTTATTG CTTGCAATTT GATCAAACAA CATCTTAGTC 12 9 60 

ACCTAAACCG CTAAAATCAA CTTTAGGAAC CGACTTTTCC TCTTCAGGTG TTTGAAGGAA 13 020 

ATCTGCCGCT TTAAATCCAA ACATTCCAGC GATAATATTG CTCGGGAAAG TTTCTAATTT 13 08 0 

TACATTGTAG TTGCTGACAA CACTGTTATA GAGTTGACGA GAGTAAGAAA TTTTATTTTC 13140 

TGTGTTTGTC AACTCCTCTT GCAATTTAAC AAAGT T AG C A CTAGCTTTCA AATCTGGATA 13 2 00 

GCTTTCTGCA AC TGC AAAAA TACCTGAAAC CTGACGAGTG AGGGCATCAC TGGCTTTCAT 13 2 60 

AGCTTCTGCT GGTGAAGTCG CTGCCGCCAC TTGGTTACGT AGTTCTGCCA CCTTTTCAAG 13 320 

GGTAGAACCT TCATATTTGG CATAACCTTT TACAGTCTCA ATCAAGTTTG GCAAGAGGTC 13 3 80 

ATTGCGACGT TTCAACTGAA CATCAATCTG ACTCCAAGCC TCCTTGGTTT GCATACGATT 13 440 

TTTAACCAAA CCGTTATAGC TAACAATCAC AAAAAT AACA ATAAGAGCGA TAACTCCAAG 13500 

AATAATCCAA GTCATAATAT AAGTCCTTTC TGCTTTTAGA TTAGTACCAG T AT AT C AAAT 13 560 

TTTCT AT GAT TGTGGTAAAA T AAG AT GAT A CTAAAGAAGG AAAT AAC TAT GAAACCAAAA 13 62 0 

ACATTTTACA ACTTGCTTGC CGAGCAGAAT CTTCCACTTT CGGACCAGCA AAAAGAACAA 13 680 

TTTGAACGTT ATTTTGAGCT CTTGGTCGAG TGGAATGAGA AGATTAATTT GACGGCGATT 13 740 

ACGGACAAGG AAGAAGTTTA TCTCAAACAT TTTTACGATT CGATTGCACC CATTCTTCAA 13 800 

GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 13 8 60 

AGTCTACCAA TGAAAATTCT CTATCCGGAG TTAGATGTGA CCATTATTGA TTCACTCAAT 13 920 

AAGCGCATCA ACTTCCTACA ACTCTTGGCT CAAGAACTGG ATTTGAACGG AGTTCATTTC 13 9 80 

TACCACGGAC GTGCCGAAGA TTTTGCCCAA GACAAGAACT TCCGTGCTCA ATATGATTTT 14 040 

GT AAC AG CT C GTGCGGTTGC CCGTATGCAG GTCCTATCTG AATTGACTAT TCCCTACCTT 14100 

AAGGTTGGTG GCAAACTATT AGCACTCAAG GCTAGCAATG CGCCTGAGGA AT TAT T AG AA 14160 

GCTAAGAATG CCCTCAATCT CCTTTTTAGT AAGGTCGAAG ACAATCTCAG TACGCCCTAC 1422 0 

CGAATAGAGA TCCGCGCTAT ATCACAGTGG TAGAAAAGAA AAAAGAAACA C C AAAT AAAT 14280 

ATCCACGTAA GGCTGGTATG CCAAATAAAC GCCCACTTTA AATTTTTTAG TAAACAAATG 14 34 0 

TTTACAAAAT CAGCCTCGCT CTTTTATTTC TAGGCTCGGG AAAAAATGAT TTACAAAATC 14400 

AGCCTCGCTC TTTTATTTCT AGGCTCGGGA AAAAATG AT T TACAAAATCA TTTTTTTCTG 14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA G AC GC AG AAA 1452 0 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 14 5 80 

TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14 640 

TGTTGATTTG TTAC 14 654 
(2) INFORMATION FOR SEQ ID NO : 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AG AAAAATC T GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AG AT T TG AAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 12 0 

C C AG AACGT A CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 24 0 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TG AC C AT TAT TGCCCATGTT GCTTGCAATA 3 00 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 42 0 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 54 0 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 6 00 

AC C AT AT CGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 6 60 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 72 0 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 7 80 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT G AC AC C AGTG ACT G AAG AAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 9 60 

AATCTAAAAG GAGGTATTTA TG AC AT AT T A CGTTGCAATT GATATCGGTG GAACCAACAT 102 0 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAG AC C AAA GAT AT CGT AG CTAGTTATTT 114 0 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 12 00 
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GGGTGAGATT TTCTATGCTG GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 12 60 

GGAAATCGAA GAAAGCTTTA CTATTCCTTG TGAGATTGAA AATGATGTCA ACTGTGCAGG 13 2 0 

TCTTGCTGAG GCAGTATCTG GTTCAGGCAA GGGAGCAAGT GTGACACTTT GC T T G AC CAT 13 8 0 

TGGAACCGGT ATCGGTGGTT GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 1440 

TTCAGCCTGT GAAGTCGGGT ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

TACAACAGCT TTAGTGAAAT ATGTAGCTGA AGCCCATGGA GAAGATGTTG ATCAGTGGAA 15 60 

TGGCCGTAGA ATTTTCAAAG AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 162 0 

CCGTATGGTT G AC TAT C T AG GAAAAGGTCT GGCAAATATT TGCTACGTTG CCAATCCAGA 1680 

AGTGGTTATT CTTGGTGGTG GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 1740 

TACAGCCTTG AAAGAGGCTT TGGTACCAAG TTTAGCAGAA AAAACACGAT TAGAATTTGC 1800 

CCATCACCAA AAT AC AG C AG GGATGTTGGG TG CAT ATT AT CATTTTAAGA CAAAACAATC 18 60 

CTAGTTTGGC TCAGCCAAAC TAGGATTTTC TTACACGTTT TTGTCTACGA TAGCCGTTGA 192 0 

GTTTTTTATT TTCCCAGTAG CTATTAAAGA TTTTTTCCTT GCTTTCGCGA TTGATTTCCA 1980 

AAAAGTAGGC ATAAATCAAA TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 204 0 

GGATATAGGA GGGTTGGCTG TGGGTGGCTA CAAGAACAGT CTCTGTATAG GTCTGGCTAT 2100 

CTTTATTGGG AACACTTGTA AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GACTATCTAA AATAGAAGGA GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AG AC AAT T T T 22 2 0 

CATCCATGAT GCTGGTTGTC CAGGCAAAGC CGTCTTGGTC TGTCAAAGCT TCGCAGACCA 22 80 

CACCTAGTCG CATAAAACGT AATTTCATTT CACGGGCGAC GAGGCCAGAA CTCCCTGTTC 2 34 0 

CAAAGAAGTA GATACGCTCA GCATCTTCGA TTAGCTGGGC AATTCGTTCT AGTTGGATTT 24 00 

CGTCAATCAA GTCTTGTGTT TGTTCCCTCA TATTGCTATA ACTTCTGAGG ACTCGTTTGG 24 60 

TCAGTGGACT GTGCTTGGAG ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2 52 0 

AAATAAATTC TCGGTAGCCA GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 25 8 0 

GAGAAATATG TAATTTTTGG GTGACTTGTT GAGAAGATAA ATCATCTGTA ATCGTTTCAG 2 640 

CTTGCAAAAA AT AG CG AGCG ATTTCTTGTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2 700 

CAATGATAGT TGCGATATCT GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 27 60 

GGAAGACTAG ATCAGAGAAT AGTCACACTT CATTATAACA CAT AAT AT AA GGATAGATAA 2820 

AT AAAAAC GC ATCTCTGTTT TAAAAACGAA AAAATCGAAA AAGCTTCTCT CTTTTCCATA 288 0 

ATTTTCTACT CAAATTGTGG TACAATTAAG AGTAAGATTT TAAGTTAGAA ATG AG AC TG A 2 94 0 
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TTTGTATGAG AAAATTTAAC AGCCATTCGA TTCCGATTCG GCTTAATTTA TTGTTTTCAA 3000 

TCGTCATTTT ACTCTTTATG ACCATTATTG GTCGTTTGTT GTATATGCAG GTTTTGAACA 3060 

AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 312 0 

CCCGTGGGGA AATTTATGAT GCTAGTGGAA AACCTTTGGT AGAAAATACG TTAAAGCAGG 3180 

TTGTTTCCTT TACGCGTAGC AATAAAATGA CGGCTACAGA CTTAAAAGAA ACAGCTAAAA 3 2 40 

AGTTACTGAC TTATGTGAGC ATCAGTTCTC CAAATTTGAC AG AACGC C AG CTGGCGGATT 3300 

ACTATTTGGC TGATCCTGAA ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 3 3 60 

GCTTGGATTC AGATGGCAAT CGTCTATCCG AATCAGAACT GTATAACAAT GCGGTCGATA 3420 

GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 34 80 

GTCAGTTAAA TGCTGTTGGA AACTTTGCGA CAGGAACCAT TGCGACAGAT CCTCTAAATG 354 0 

ATTCTCAGGT GGCTGTTATT GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA 3600 

CTTCTTGGGA T AG AAAGGT T TTGGAAACTT CCCTTTCTTC TATAGTTGGG AGTGTATCCA 3 6 60 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAAGCCTA TCTTAAAAAA GGCTATTCTC 3 720 

TAAATGACCG TGTAGGAACC TCCTATTTGG AAAAGCAATA TGAAGAGACC TTACAAGGAA 3 7 80 

AACGCTCGGT AAAAGAAATC CATCTGGATA AATATGGCAA TATGGAAAGC GTGGATACAA 3 84 0 

TTGAGGAAGG TAGTAAGGGA AACAATATCA AACTGACCAT TGATTTGGCT TTCCAAGATA 3 900 

GCGTGGATGC TTTACTGAAA AGTTATTTCA ATTCTGAGCT AGAAAATGGT GGAGCCAAGT 3 960 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCGGTT TTGTCTATGT 402 0 

CAGGGATTAA ACATGACTTG AAAACGGGAG AGTTGACGCC TGATTCCTTG GGAACGGTAA 4 080 

CCAATGTCTT TGTTCCAGGT TCGGTTGTCA AGGCGGCGAC CATCAGCTCA GGTTGGGAAA 4140 

ATGGAGTCTT GTCAGGAAAC CAGACCTTGA CAGACCAGTC CATTGTCTTC C AAGGTT C AG 42 00 

CTCCCATCAA TT CTTGGT AT ACTCAGGCTT ACGGTTCATT CCCTATCACA GCGGTCCAAG 42 60 

CTCTGGAGTA TTCATCAAAT ACCTATATGG TCCAAACAGC CTTAGGTCTT ATGGGGCAAA 4320 

CCTATCAACC CAATATGTTT GTCGGCACCA G C AATC TAG A GTCTGCTATG GAGAAACTGC 4380 

GTTCAACCTT TGGCGAATAT GGCTTGGGTA CTGCGACAGG AATTGACCTA CCAGATGAAT 4440 

CTACTGGATT TGTT CCC AAA GAGT AT AG CT TTGCTAATTA CAT TACT AAT GCCTTTGGGC 4500 

AGTTTGATAA CTATACGCCG ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAAT G 45 60 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT AAGGGAGGAC 4 620 

TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA TAAGGTCAAT ATATCCGACT 4680 

CCGATATGAG CATCTTGCAC CAAGGTTTTT ATCAGGTTGC CCATGGTACT AGTGGATTGA 4740 
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C AACTGG AC G TGCCTTTTCA AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG 4 8 00 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 48 60 

CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC AATCTAACAA 492 0 

ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT GTATCAAAAA TACCATCCAA 4980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA AC AC CT ATT G C C AAGTTG AT TGACAGTTAT 504 0 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCCTTTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

AC AT AT T GTT CTATTTGTGG ACGTTTGACA GACGACGATC CTTGTTCTAT CTGTACTGAT 52 2 0 

CCGACTCGTG ACCAGACAAC AATTTTAGTT CTTGAGGATA G T AG AG ATGT GGCAGCCATG 52 80 

GAAAATATCC AAGAATACCA TGGACTCTAT CATGTCCTTC ATGGCCTCAT TTCTCCTATG 5340 

AATGGTATCA GTCCGGACGA TATCAATCTC AAGAGCCTTA TGACTCGTCT TATGGATAGT 54 00 

GAGGTTTCAG AAGTGATTGT GGCGACTAAT GCTACAGCGG ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT AT C AAGGTT A CGCGTCTAGC ACGAGGTCTC 5 520 

GCTGTGGGAG CGGACATTGA GTATGCGGAC GAAGTGACAC TCTTACGAGC CATTGAAAAT 55 80 

CGGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 5 64 0 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTG TGATGAGTAG GCTCAGGTTC 57 00 

AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC GAGTATTCTA GTAGAATTAG 57 60 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATGGT GGACGGAGTG CGGAACGCGA 5 82 0 

AGTCTCTGTC CTTTCAGCTG AGAGTGTCAT GCGTGCGGTC G AT T AC G AC C GTTTCACAGT 5 8 80 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACAGGAAT TTAGTCATGC 5940 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TG AAAC CAT T GATTGGGATA AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GGCCAATGGG 6060 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTGAAA ATGCCTTACG TTGGTTGCAA 612 0 

CATTTTGTCA TCAAGTCTTG CCATGGATAA AATCACGACT AAGCGTGTTC TGGAATCTGC 6180 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CGTTGAAGGC GATGATGTGA CTGCTAAAAT 6240 

CGCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 63 00 

TAGTGTCGGT ATTTCTAAGT CTGAAAACCA AG AAGAACT C CGTCAAGCCT TAAAACTTGC 6360 

CTTCCGATAT GACAGCCGTG TCTTGGTTGA GCAAGGAGTG AATGC 6405 
(2) INFORMATION FOR SEQ ID NO : 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CGAGCTCGGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAACCAATC 12 0 

CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTGTTTGAA CGCAGTAAAG CTCGCGGTTT 18 0 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 2 40 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 3 00 

TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAAT CT AG 3 60 

AAAGAAAATA T G AAAG AT AG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 42 0 

AATGATTTGG CTCAGGCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTGATTAAA 48 0 

ACCTTGTCCT TAATGGAAAG AAAGCACCAA ATTCGTTTTG AAGAAGATGG TAGTCTGACA 540 

TTAGAAATTA AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCAT AAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GAGGACGACC TTTTTGTAGG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAAGAAAGT CGCTGACCGC 720 

AATAAGGGAA CAGCAGCAGA AGCCAAAATT ATTGATATCC TAGAACACAG TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAAGT ATGCTGGCTA TATTCGTTCA 84 0 

AAAAAT C AG A AAATCAGTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA AT T AG AAGG A 90 0 

AC AG AAGT TC TCAAAGTCTT TAT C GAT AAA TACCCAAGCA AG AAAC AT G A TTTCTTTGTC 960 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 102 0 

TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTG TTGTTAAGGA AG C AG AAAGT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 1140 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATGCAGTGCA TAT C AAGG CT 1200 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAG ATGTTTCTTA TTATGTGACC 12 6 0 

GAGGGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTG CG A CTTCTGTTTA CGTGACAGAC 132 0 

CGAGTGGTGC CAATGCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 13 80 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 1440 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 1500 
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G AT AT CCT AG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT AT C AAGT AT C 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 162 0 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATGTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 18 00 

GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTG AT T AT G CTTCGAGTTT TGGCTTGCGC 18 60 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAG AC AT C AT GCGTGCTGTT 192 0 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GC AG CAGGCT 1980 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2 040 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GT AT G ATT CG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA C AAGT GAT T C CAGAGATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC C AT AG AAGC T GAGCGTGAAG TCGAAGCCAT GAAAAAGGCT 222 0 

GAGTATATGG AAGAATACGT GGGTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 22 8 0 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 23 40 

CCTGAATTTT AT CAT T T C AA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 2 400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2 4 60 

GAGATTGATT TTTCATTCGT ACCTAGTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2 580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA GAAGAAAAAA 2 640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2 7 00 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 27 60 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 2 82 0 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2 94 0 

GC TCC AT AAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3 000 

AGTTCCCCTT AAGGTC T ATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TAT C AAACGT CGTGAGCAAA AT CG AG AT AT 3120 

CGCGCGTGTG ATGAAAG CTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 3 24 0 
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AAAGCACAAT ACAAAAGTTG GGACTTGGGG GAAAATTACT GTCTTGAAGG GAGCTCTCAA 3 300 

GTTTATTGAA TTGACAGAAG AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 3 3 60 

AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATGATGT 342 0 

GGAATGGTAC TTGGAATTTT ATTGTAAACC T G AGG ATT AT TTTGCTAAAA AATACAATAC 3 480 

CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3540 

GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3 600 

GACGGCTGTA GATCAAAATG GACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3 6 60 

AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 3720 

TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 3780 

TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 3840 

GGACACGGAG GATTATCCTT GCTCGGTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3 900 

GGCAGACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT GAAAATCCAG GCCATTTGCA 3960 

CCGTCGCGAT GAGAATGGCA AT CGT AT TC A ACTACGCTTT GCGACCTTAC TAGCTAAGAA 402 0 

AATCAAGTAA ACACACATGA AG AT T AGG AA TTTTCCTGAT CTTTTTTCTT TTTTACGAAT 4080 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT GCTAGGGGAG AATTGGTAAA 4140 

TGTGTTAGAG GAT AAACTT G AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 42 00 

CCATTTGCGT CAAGGACCAA GTGTACGGAC GCATTTTGCC CATAAATCCT TAAAAGACTG 42 60 

TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC AATAAGGAAT CCCTCTATCA 43 2 0 

CTGGTTGAAA AAAGAGACAA AGGTTCAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 4 380 

TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT CAGTGTAGTC CCTTGCCTCA 444 0 

GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTGTGGTT 4 500 

GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CT AC AG CAAG GTTTTCTTTA 4 560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 462 0 

ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC CAT TAT C AAA TCAAGGAATT 4680 

TTCCTATGGT CAAGGTAGTT TAT TGGAAAT ATTGCGTCTT CCCTATAAGA GACAAAAAAT 4740 

ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC TATATCCGGC AACAACTTTA 4800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 48 60 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGG GCAAATTTTT 492 0 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4 9 80 

TCCTCAAAAT GAT TGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA GAATAGAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT GACCAGGCTT GGGAAGAAGC 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC AGTAGCCCAG TATGAAGGCC ATCTCTTGGA 5220 

TAGTGCGGAT AACCTACTAG AAATCACTGA ATTTTCTCTT GAAATGGAAC G C C AG AT AG A 52 80 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG CTAAGTATCA 5340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CAGCCAGTTA GACCAAGCCT TTTCATTCTA 5400 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT GACTTTTTAG AAGCTCAACC 5460 

AAAGCTGCAG GTTTATCAAC ACTATTTTGA CAAGCTTTTG CAAGGCAAGG AT C ACGTTC T 552 0 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5580 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATGG 5640 

TAAAGAAGTT CAGCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 57 00 

GGTTCGCCGT GGTGCCTATC AAGCTCTTTA TGCGACTTAC GAACAATTCC AACACACCTA 57 60 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCGTAACTA 5 82 0 

CAAGAGTGCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCAGAAAGTG TTTATGACAA 5880 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 594 0 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATGTC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATGCTT TGGCAGTCTT 6 060 

GGGTGAGGAT TACTTGAGCC GTGTTAAACG TGCCTTCAGC GAGCGTTGGA TTGATGTTTA 612 0 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTGGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAACAGG 624 0 

T C AC AGTATG CATTCAAGCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 63 00 

TATCTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 63 60 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCC TAGATGGTTT 64 2 0 

CCGTGGAACA GTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 64 80 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 654 0 

C C AAGAGT AT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCG 6 6 00 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGGCT TTGCGGCCGC 6 660 

CTCAGCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA 672 0 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATGTC ATGAGAAAAG CTGGTGTTGA 67 80 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC GTTTAAATGA 684 0 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6900 

ATGCTAACCA TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC C GC AAGG AAA 7 02 0 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTCCG TTTTCTTATG GAAACCATGC 7 080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 714 0 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 72 00 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 72 60 

CGGTGGATGT CTTATCTACA C TG AC AG AG T CTTATGATTT CGTCTTTATG GATTCTGCCA 73 2 0 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCGTCAAACA TTTGGAAGTT GGTGGTGTGG 7 380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAAT T ATT TGATGCAACC TTAGACAATC 7 500 

CAGAACTCAC CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7 5 60 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 762 0 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7 680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 774 0 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAG C AAGTG A 7 800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CAT C C AAAAA GTTTTTGAAA 7 860 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA T ACT ATTG C C GAAGAAAAAA 792 0 

AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC AGGTATGACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCGTACA AGT AAAT TAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8 040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AG AAATT AC C TTTGATTCTG CTTCAACAGA AGT AC C T GAG CAAGTCAAAA 82 80 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 8340 

AAGCCTACAG TAG C C AAT AT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

AT ATTG AT G A C T AC AAAGAA AAATTAAAAA CTGTTATCTT G ACT C AAAAA CAAAATGATT 84 60 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 852 0 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGAT T C A AGCTCAAGCA 85 80 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8640 

AAAATGAAGC AAACATTCCC ACAATAAAAC G C AT AGT AC A AGGTTTGTAC TGCCCCCCAA 8700 

AAAGT TAG AC AATTAATTTA TCCGAAGGAT TTAGTTCTGT AT TGC AC AG A GCTAAGTCCT 87 60 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8 82 0 

TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 

ATGCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8 940 

GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 

TGGTCACTAT GGAGAATCGT ATTCTCGTAG TGCTTCTCTT TGAATGCCTG TTCCAACATT 9060 

AACGATCAAT CAATTTAATC ATGTACCTAA GATTAGAATT GTTTATCCCA AATTTATTTG 9120 

AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 918 0 

AAGTTAATTT CATAATAAAA ACACCCCAAA AGT T AG ATT T TTTCTGTCTA ACTTTTGGGG 92 40 

TGTAGTTCAT GT AC AC CTG A TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9300 

TCATTTTTTT AACT TG AT AC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG 93 60 

GCTGCTCAAA GAACAGCTTT GAGGTTGTAG ATAAAACTTG TGAGGTCACC AACATATATA 942 0 

ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG 948 0 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA GTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 972 0 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9 78 0 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAAGTATC 9840 

ATTCCAAGTA TTGGGAGTGA ATGT TT C AAA ATCATGGGTT TCTATAATGG TCAGGCTGGC 9 900 

AT T TGC TAG A CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG 9 9 60 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATTTG ATAAATTGGA TGGTCCGTTG AGTTGTACTA TAGAGGGATT CGGCTCCGAA 100 80 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TGGATTTGTT GCGGGTAAAT 1014 0 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AG T TGC C ATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA G AGT T G AC T T TGGATAATCT C AG C AG ATT T 102 60 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT AT C T AC AAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 1050 0 

GGCGCGACGA TAATGCTGTG TCCGACTTCG C CTG C AG AG A CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10 62 0 

TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10 8 60 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATT ATT TAT A TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG G C AAT AT AG A 1122 0 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 112 8 0 

GGATATGGGC ACCGTCGGTA TCCGCATCG 113 0 9 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 8 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT AT CC CTG ATT CATGTAAAGA TTGTGTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 12 0 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC C AC C T AAT AA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 24 0 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 3 00 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA T CTG AT AC AG AAAT CTT AAT 3 60 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 480 

G AC AT CAT C T GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 600 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 660 

TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGAGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 9 00 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC G AAT AT C CAT CCGTTTGAGG TCATCAAGTG AACTTATGTC 102 0 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGGC 1140 

TAG AT C C T TG TATTGCTGAT CCTGTTGGGA GACTTTGGTT TTTAGGCCAG AC AC AT AC TG 12 00 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TAT T C ACT AC TGGTCAAGAG 12 6 0 

AGGTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA GCCAGTAGCC 13 20 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 13 80 

TGCCATCCCT GC AAT C AG AA TGAGGCTAAC ACTTAGATTG AC TAG C C AAA ATTGAAGGTA 144 0 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 150 0 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 15 60 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAGACGCT CTCTAGTCAG 162 0 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 168 0 

CAGTAACTTA TTCGCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 174 0 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 18 00 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA 1860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 198 0 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2 040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2220 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2 34 0 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 252 0 

T AC C AG ATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 25 80 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2 640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 27 00 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2 7 60 

CG AT TTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 282 0 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2 880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2 940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3 000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3 060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 312 0 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CAT C GT AAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 324 0 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3 3 00 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 33 60 

ATATTTAGGG GTCATGACTA GTGAAG C AGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 342 0 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG G AC AT AAAT C 3480 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTGGC 354 0 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC G ATT T AC AT C TTCTCTGGCG 3 600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3 6 60 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 372 0 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGG AT AC C A 3 9 60 
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TAGCCTCTAG GTCGAAGTCG TATCCATTTG TACGGTCGGT TTTGAGGTCA TAGAATTTAA 402 0 

TGGCTCCAAT CCCAACAGCA TGTGCTACTT GGTCTTTGTT TTCTAGTTCA GGATTTTTAG 40 80 

CCTCGATTTG GACCTTGGCA CGGCTAACAG CCTCTGCAAC AGTAGGCTCT AGCAAGATGA 4140 

CATTCCCTTT ACG AG TAG AG AGTTTCTTCC CTTCTTTTGT AACCAAACCA AAAGGAACGT 4200 

GAGTAATGTC GTCACTCCAG TCGTAGCCCA TCTCTTGCAA GACAGCTTTG AGCTGTTTAA 4260 

AGTGGGCAGA TTGTTCTTGA CCAACGACAT AGATAGATTT AGCAAATTGG TATTCGTTTT 4320 

TACGGTAGAG GGCTGCAGCC AAGTCACGTG TGATATAGAG AGTTGCACCA TCAGACTTCT 43 80 

TGATGAGGGC TGGATGTTCA ATT C CAT ATT TCTCAAGATT CACAACTTGG GCACCTTCTG 4440 

AT TCAAGAAG TAGTCCTTTT TCAGAAAGAA TGTCTACAAC TGCATCCATC TTATCATTGT 450 0 

AGAAGGCTTC TCCGTTATAG CTGTCAAATT CAACCTTCAA TTCATTGTAA AGGCGGTTAA 4560 

ATT CC ACT AA ACTTTCATCG CGGAACCATT GCCAAAGAGC GAGAGCTTCC TC AT C T C CAT 4 62 0 

TTTCAAGTTT ACGGAACCAT TCGCGCGCTT CTTCATCCAA GCTAGGGTCA TTTTCAGCTT 4680 

CAGCGTTGAT GCGGACATAG AGTTTAAGGA GTTCATCGAT TGGATGAGCT TTTACAGCTT 4740 

CTTCGTCGCC CCATTTTTTG TAGGCAACAA TCAACATCCC AAATTGTTTA CCCCAGTCTC 4 800 

CCAAATGGTT GACCTTGACC GTTTGATAAC CGATTTTTTG GAAAATATGT G AC AAGC T AT 4 8 60 

CTCCGATAAC AGTTGAACGC AGGTGGCCAA TAGAAAATGG TTTAGCGATA TTCGGACTAG 4 92 0 

ACATGTCGAT AACAACATTT TCTTGTTTAC CAATATTTTG GTCAGCATAG TGTTCTTTTT 4980 

CAGTGGTAAC AGCTTGCAAT ACTTGAGCAG AAATGGCAGA TTTATCAAGG AAAAAGTTAA 5040 

CGTAAGGTCC TGTTGCGACA ACTTTTTCAA AGGCTTGGCT GTTCATTTTT TCAGCCAGTT 5100 

CAGCCGCAAT CATTTGTGGT GCTTTACGTT CGACTTTTGC AAGAGAAAAA GCAGGGAAAG 5160 

CAATGTCTCC CATTTCTGAG TTTTTAGGGG TTTCCAGTAA CTTTAAAATA GCCTCTTGGT 522 0 

CCAGGCTATC AATGATGCTA GATAATTCGC TAGCAATCAA TTCTTTTGTA TTCATTAAGA 52 8 0 

GCTCCTTTTT GGACTTTTCT ACTATTTTAT CACAATTTTA AAGAAAGAAG AAAAAATTTT 53 4 0 

TGAAATCTCC TGTTTTTTTG GTATAATATG GTTATAAATA TAGTTATAAA TATGCACGCA 5400 

AGAGGATTTT ATGAGAAAAA GAGAT CGT C A TCAGTTAATA AAAAAAATGA TTACTGAGGA 54 60 

GAAATTAAGT ACACAAAAAG AAATTCAAGA TCGGTTGGAG GCGCACAATG TTTGTGTGAC 5520 

GCAGACAACC TTGTCTCGTG ATTTGCGG 554 8 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TACCCGGTAG TCTTAGCAGA CACATCTAGC TCTGAAGATG CTTTAAACAT CTCTGATAAA 60 

GAAAAAGTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA 12 0 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC AT T AAGG AAA AAGAAGTTGT TAGTAAAAAT 180 

CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA T C AAAGAAG A AAATTCCAAT 24 0 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 3 00 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 3 60 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 42 0 

GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 480 

TCATCGGTTG AAAGGG C AC A AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT 54 0 

GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 6 00 

GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 6 60 

ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA CTTAAAAGGC 720 

AC TG AT AAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 7 80 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 84 0 

CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC 9 00 

ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATGTATTC TGACGCAGGA 9 60 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AAG AT TCT AT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 1140 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 12 60 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 132 0 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 13 80 

GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 144 0 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 15 00 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT 15 60 
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TACTACAATA 


GAGATAATTG 


G AC AG AG C T T 


CCAGCTATGG 


GATATGAAGC 


GGATGAAGGT 


1620 


ACTAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


GATGATGGTG 


TAAAGCTATG 


GAACATGATT 


1680 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA 


AGATAAATTG 


1740 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTCCA 


ACAAACCGAA 


TGTAGGTGAC 


1800 


GAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCGTTC 


CAGCAGGATC 


TACATCTTGG 


GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


T TGAT T AGAC 


CGAAATTAAA 


GGAAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


TCTTAAGGGA 


2100 


GATGACAAAA 


TAGATCTTAC 


AAGTCTTACA 


AAAATTGCCC 


TACAAAATAC 


TGCGCGACCT 


2160 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGAGCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


AACTT TC AAA 


2280 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG 


CATCAGCGAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT 


TGAATGCGGT 


T AT AAATGT T 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 


CAGCGGGAAG 


2700 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTGCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAG AT T C AAA 


TGGAAATCCT 


3060 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO : 111: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 672 base pairs 



WO 98/18931 



PCT/US97/19588 



806 

(B) TYPE: nuc.leic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

C GAG AT TT C T TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTGGGAGA GTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TT AG AT C C T G CAAAAGGCAG 24 0 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGT TAG AT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 3 60 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 42 0 

ACAATCATTA GTACCTATAG C T AAT AG AT A CATAATTGAC ACGAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TT AC TAT AT T AT T T AT ATTT ACTGTTTCAT T C T C ACT AAT 540 

GT ATT TAT T A AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 6 60 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 72 0 

TTTTATAGCA GCTATACTTG ATTTGT T AAT GATTGTGGTT TATGCTGTGG TTTTATTTAG 7 80 

CTTTTCTAAG TACATGGTAA TCTTTTTAAT ATCACTAAGT CTAGCTCTAT CTATTGTAAT 84 0 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATT G AT AAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC TAACTGGAGA 9 60 

AGAGGAATTT TGGATTAACA AATGGGATAA T TTT AAT AC A AAACAGCTCA TCATAGGTCG 1020 

AAAACT TGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TGACGTTAGG 114 0 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 12 60 

T AAT AC T AAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 132 0 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TGTTGCTATT GTTGGAGAAT CAGGTTCAGG 144 0 

TAAGAGTACA TTAGCTAAAA T TTT ATT AGG TTTATTAGAA CCTAATATTG GTT C AAT AG A 1500 

AGT TGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 15 60 
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AGTGTTACAA AATTCAACCC TAAGTTATGG TACCTTAAGA GAGAATTTGA CATTTGGACA 162 0 

CTTTGTTTCA GATGAAGAAT TAATGACAAA TCTAAATTCA ATTGGTCTTA GCAATGTAGT 1680 

TAAATCTTTA CCTCTTGGAT TAGAGACAAT CATCGCTGAA GAAGGTAATA ACTTTTCTGG 1740 

AGGGCAGCAG CAAATGATAC TTTTAGCTCG TTGTCTTTTG TCGAAACCTT CGGTAGTTGT 1800 

TTTGGACGAA GCAACAAGTA GTTTAGATAA TTTATCTCAA CAAATTACAA CTTCTTACTT 1860 

AAGTGAAATC GGTACCACTA AGATTTTAAT TGCCCATCGA CT AG AT ACT A TCAAGTCTGC 192 0 

AG AT AAG AT C TTAGTAATGC ATAATGGTGA AATTGTAGAG ATTGGGACCC ATAGAGAACT 1980 

TCTTGAACTA GGAGGCATTT ATAAGCAATT GTATTCAAAT AATTAGTTTT TGATTAAAAG 2 040 

GGTAAATTTA TGAAGATTAT GAAAAAAAAA TATTGGACTT TAGCGATATT ATTCTTTTGT 2100 

TTGTTCAATA ATTCTGTTAC TGCTCAAGAA AT AC CT AAAA ATCTTGATGG CAATATAACT 2160 

CACACTCAGA CTAGCGAAAG TTTTTCTGAA TCTGATGAAA AACAGGTTGA CTATTCTAAT 222 0 

AAAAATCAAG AAGAAGTAGA CCAAAATAAA TTTCGTATTC AAATCGATAA GACAGAATTA 22 8 0 

TTTGTAACAA CAGATAAACA TTTAGAAAAA AACTGTTGTA AATTGGAACT TGAACCACAA 2 340 

ATAAATAACG ATATTGTTAA CTCTGAAAGT AATAATTTAC T AGG C G AAG A TAATTTAGAT 2400 

AATAAAATTA AGGAAAATGT TTCTCATCTA GATAATAGAG GAGGAAATAT AG AG C ATG AC 24 60 

AAAGATAACT TAGAATCGTC GATTGTAAGA AAATATGAAT GG GAT AT AG A TAAAGTTACT 2 52 0 

GGTGGAGGCG AAAGTTATAA AT TAT AT T CT AAAAGTAATT CTAAAGTTTC AATTGCTATT 2 5 80 

TTAGATTCAG GAGTCGATTT ACAAAATACT GGATTACTGA AAAATCTTTC AAATCACTCA 2 640 

AAAAAC T ATG TCCCCAATAA AGGATATTTA GGAAAAGAGG AGGGAGAGGA AGGAATAATA 27 00 

TCAGATATTC AAGATAGATT AGGTCATGGT ACGGCTGTTG TAG C T C AAAT TGTAGGGGAT 27 60 

GACAATATTA ATGGAGTAAA TCCTCACGTT AATATTAACG TCTATAGAAT ATTTGGTAAG 2 82 0 

TCGTCAGCTA GTCCAGATTG GAT T GT AAAA GCAATTTTTG ATGCTGTAGA TGATGGCAAT 2 880 

GAT AT TAT C A ATCTTAGTAC TGGACAATAT TTAATGATTG ATGGAGAATA TGAGGACGGA 294 0 

ACAAATGATT TTGAAACATT TTTGAAGTAT AAAAAGGCTA TTGATTACGC GAATCAAAAA 3 000 

GGAGTAATTA TAGTAGCTGC ATTAGGGAAT GACTCCCTAA ATGTATCAAA T CAGT C AG AT 3 060 

TTATTGAAAC TTATTAGTTC ACGCAAAAAA GTAAGAAAAC CAGGATTAGT AGTTGATGTT 3120 

CCAAGTTATT TCTCATCTAC AATTTCGGTC GGAGGCATAG ATCGCTTAGG TAATTTATCA 3180 

GATTTTAGCA ATAAAGGGGA TTCTGATGCA AT AT ATG CGC CTGCAGGCTC AAC ATT AT CT 324 0 

CTTTCAGAAT TAGGACTTAA TAACTTTATT AATGCAGAAA AATATAAAGA AGATTGGATT 33 00 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3 3 60 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT T AAAAG AT C A GCCCTATAAT 3 42 0 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

T AAAT AT AC C AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA T AT AAAAAT G 3 540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA G AG AG AATT C 3 600 

AAACT AC TAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3 6 60 

GGT ACT C T C A AAAT TAT T T A CCTAACACTT CAGAAAATAC AT AT AATT C A TTACAAGTCA 372 0 

GTTTAGTTGG AGTATTACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 37 80 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3 84 0 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GT C GC AT AT C 3960 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 4 02 0 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 408 0 

C ATTGT TATA ACACCCCAAT CAACTGGTCC ACAGTCCATT TTGTTTTGGA TAGACGCAGT 4140 

AC AG AAC T AC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 42 00 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTT AC C AC AACTACATCA CAT T ATTGG A 42 60 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 43 2 0 

AATCTTGTTG TT T AAC ACT A TGAATAGGCT CTACTTTGAA G AATT TAG AC GTGCCATTTT 43 80 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 444 0 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4 500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4 620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4 6 80 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 474 0 

AC ATG ATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4 800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4860 

GCTTAATTGA AAACCAAAGT ATT G AAG AAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 492 0 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 504 0 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AG ATG AG C C A ACAGCTTCAA 5100 
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TAGACCCAGC AACCTCTCAG TTGATTATGG AGATTTTGCT ATCTCTTCGA GATGATAATA 5160 

GGCTAATCAT TATCGCAACA CATAATCCGG CAATTTGGGA GATGGCTGAT GAAGTGTTCA 522 0 

C GAT GG AT C A TCTGAAATAA AAATCCTTGT TTTTAATTGC ACGATGAGTT ACTGAAATAT 52 80 

TATCATGAAT CAAGAATTGG AGTTAATTTA GAATTGTACT TAATTTAGAA TTGTACTTTA 534 0 

TTAATATTGA GGTAACTTTT TCTTGATAAA GGAAGAAATA ATGGAGAGGA AGTTAGAATG 54 00 

AAAAAATTCG ACAATTATAT TATTGAGAAG CCTTGCGATT CTAATTCAGA TAAACTGCAA 54 60 

AAAATCTTAA TAATTGAAAG TTTGGTAGAT GATATTTTGC AATTTTCTCT CAGAATCAAT 552 0 

AATAGTGTAG GAGAGATTTT CCTCCTACAA CCGTTTTAAA AGAAAACTAT CTTTATTCCA 5580 

TGTTATTTTG AGGAAGATAT TGTGAAAGTC AAAGATGATG AT AAAGT TG A GTGGAATTTG 5 64 0 

TTAGAATTTC AAAAATTTAG AGCATTTTTG GCTTAGTAAT CTGTGTTGAA GGCTCAAAAC 5700 

CTATGGTAAA AAAGTAGCTT TGAAAACGTA TTGCCTCCAA AGATTTAGTT AAATAATGAT 57 60 

TTAACACAAA AAGAAATTAT TGAAGTTCTG GAAAGATGTT GTTTCAGTAT TGAGAAAAGG 5 82 0 

TGGGAAAAAC TTGCGATTTT CACAGAGAAA GGAAGAAAAA GTATAGAAAT ATAGTCAATT 58 8 0 

GAAACAAGAA CAGGATAAAA GAACCTTTTG TGCCATATTT TTCTCCTTTC GCTTTACAAT 5 940 

TGGATTGAAC ACCTTTATTG TATCGCGTTT GGAGTTTTTT TGGTATAACC TTCGACGCAC 6000 

ACCCGCATAG CGGGTGTTTT TTTTGTCTCG CACCTAACGG AGCGAGACAA ACTAATAGTC 60 60 

ACTTAATCAA AAAACGCACC AT AT C AAAAA CTAAAAAGTT TGATATCATG CGTCATGTCT 612 0 

TAAACTAATT G ACT AT AC T T TCTATTCAAA TGAGCTTTTA ACCAATTGAT TGAGCCAATC 6180 

CACTCTTAAA AC C AAAGAG C AATTTCTCGC TTAGCTGACT CTTCTGAATC TGAACCATGT 62 4 0 

ACAACATTTT GGATAATCTC ATTTTCTCCA GCAGCTTTTG C AAAATC AC C TCGAATAGTG 6300 

CCTGGTAAAG CTTCTTCTGG ACGAGTTGCA CCCATCATGG TCCGCCAAGT T T CG AT TACT 6360 

TTGGGACCAG AAATGACACC CACAAGAACT GGACCTGAAG TCATGAATTC ACGAATCGGT 642 0 

GGGTAAAAAC TCTGACCAAC CAAGTCCTGA TAGTGCTGGT CAATCAACTC TTCTGAAACC 6480 

TGTGAACGAA ACTCCAATTT TTCGATTGTA AATCCACGTT GTTCGATGCG CTTTAACACT 6540 

TCACCCACTA GCCCTCTTTT TACACCATCT GGTTTGATGA TAAAGAATGT TTGTTCCATA 6600 

CCCGTCTCCT TTGTCAGCTT CTTTCTTTTA TTTTACCACA TTTCGTGGAA AAATGGAGAA 6660 

AGTTTTCAGA AGAGAGAATG AGAGAACCCT CGGGTTCTCT CATTCTCTCT TATTCTACTG 672 0 

TTTCTTCCAC AGTTTCAACG GCAGTATCCA CAACTACTTC TGTTGTTTCT TCATTTCCTT 6780 

CTTCCTCTAC TGGAGGATTA AGGTATTCTT CTTCGTTGAC AGCATGTGGT TCAAGGTTAC 6840 
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GGTAACGGGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7 080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

T G AC ACGG AC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 72 00 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 72 60 

AG AC AT C ACG AACTGCAAGG AGACGTTTTG GTT GG AT AG A ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACG AC AT AT T 73 8 0 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7 500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7 5 60 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7 62 0 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7 680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7 800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7 860 

GACGCTCTTC GAG AG AC T C G ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 792 0 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 804 0 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CGCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TG AT CAT AC C ACGACGGAAT TGTTTGGTGA 8340 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 852 0 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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TGTTTGGTTC 


TTGCAAGTAT 


GGTAGCCCCT 


CTGGCATGAT 


ATCGTTGAAG 


AG AAT TT TAG 


8700 


C AACTGT TGT 


AAGCAAGACC 


TTATGTCTTT 


GCTCTTCTGT 


CCAAGGCTTG 


TTGAGGCTGT 


8760 


CTGTTGCGAT 


ACCAACACGT 


GAGTGGAGGT 


GAACATAACC 


ATTGCGGTAA 


GCCATAACCG 


8820 


CTTCGTCACG 


GTCTTTGAAG 


ACCATTCCTT 


CACCTTCGCG 


ACCAGCTTCT 


TCCATGGTCA 


8880 


AGTAGTAGTT 


ACCCAAAACC 


ATGTCCTGAG 


ATGGAGTAAC 


TACCGGTTTC 


CCATCTTTCG 


8940 


GGTTCAAGAT 


GTGCTCAGCA 


GCTAGCATGA 


GGATACGAGC 


TTCTGCTTGT 


GCTTCTTCTG 


9000 


AAAGTGGTAC 


GTGGATGGCC 


ATTTGGTCCC 


CGTCAAAGTC 


AGCATTGTAG 


GCTTCACAGA 


9060 


CAAGTGGGTG 


CAAGCGAAGA 


GCCTTACCAT 


CAATCAAGAC 


TGGCTCGAAG 


GCTTGGATAC 


9120 


CCAAACGGTG 


AAGGGTCGGT 


GCGCGGTTCA 


AAAGCACTGG 


GTGTTCTTTA 


ATCACTTCTT 


9180 


CAAGGATATC 


CCAGATACGC 


TCATCTCCGC 


GTTCCACCAA 


GCGTTTAGCT 


GCTTTGACGT 


9240 


TTTGCACGAT 


ATCACGGGCA 


ACGATTTCAC 


GCATGACAAA 


TGGTTTAAAG 


AGTTCAATCG 


9300 


CCATTTCACG 


CGGCACACCA 


C ATT GGT AC A 


TCTTAAGAGT 


TGGACCAACG 


GCGATAACTG 


9360 


AACGTCCTGA 


GAAGTCAACA 


CGTTTACCGA 


GCAAGTTTTG 


ACGGAAGCGT 


CCTTGTTTAC 


9420 


CTTTAAGCAT 


GTGGCTCAAT 


GATTTCAATG 


GACGGCTACC 


TGGTCCTGTG 


ATTGGACGAC 


9480 


CACGACGACC 


ATTGTCAATC 


AAAGCGTCAA 


CTGCTTCTTG 


AAGCATACGC 


TTCTCATTTT 


9540 


G AACG AT GAT 


ACCTGGTGCA 


TTTAACTCAA 


GCAAACGAGC 


CAAACGGTTG 


TTACGGTTGA 


9600 


TAACACGGCG 


GTAAAGGTCA 


TTCAAGTCAG 


ATGAGGCAAA 


ACGGCCACCA 


TCCAACTGCA 


9660 


ACATTGGACG 


AAGATCTGGT 


GGGATAACCG 


GAAGGATGTT 


AAG AAT CAT C 


CATTCAGGTT 


9720 


TGTTTCCAGA 


CTTGTAAAAG 


GCATCCAAAA 


CATCCAAACG 


ACGGATGGCT 


TTGACACGCT 


9780 


1 rift i >— « m -m — j — » 

TTTGTCCAGT 


AGCTGTTTTC 


AATTCTTCTT 


TGAGTTCAGC 


AATTTCTTTT 


TCAAGATCTA 


9840 


CTTGCTTCAA 


AAGGTCTTGG 


ATGGCTTCCG 


CACCCATCTT 


GGCAACAAAT 


GAACCATAAC 


9900 


CATATTCACG 


CAAGCGCTCT 


CGGTATTCGC 


GCTCTGTCAT 


GATAGACTTG 


TGCTCAAGTG 


9960 


GTGTATCCTT 


AGGATCAATC 


AC C AC AT AAG 


CCGCAAAGTA 


GATAACTTCC 


TCGAGGGCAC 


10020 


GAGGGCTCAT 


ATCAAGGGTC 


AAGCCCATAC 


GGCTTGGAAT 


CCCCTTGAAG 


TACCAGATGT 


10080 


GAGATACAGG 


AGCTTTCAAT 


TCGATATGTC 


CCATACGCTC 


ACGACGAACT 


TTCGTACGCG 


10140 


TTACTTCAAC 


CCCACAGCGG 


TCACAAACAA 


TTCCTCTGTA 


ACGAATGCGT 


TTGTACTTAC 


10200 


CACAAGCACA 


TTCCCAGTCT 


TTTGTAGGAC 


CAAAGATCAC 


TTCATCAAAG 


AGTCCTTCAC 


10260 


GTTCTGGTTT 


CAAGGTACGA 


TAATTGATTG 


TTTCAGGTTT 


TTTGACTTCT 


C CAT AAG AC C 


10320 


ATGAACGGAC 


TTTACTTGGA 


GAAGCTAGGG 


TGATTTGCAT 


ACTTTTAAAA 


CGATTTACAT 


10380 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 1044 0 

TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA GCTTCAAAGG CTGCTTTAGC 10500 

CTCTTGGGCT GCTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 

CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 1062 0 

GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 

TTTTGGAATT GGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTG AT AT C 10740 

GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGG CTTCAAGAGC 108 00 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGGTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 1092 0 

GTGGAGTTTG AT C ATGT AC A TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG AT AT TC AT AC GTGATGGTAC 1116 0 

CCCAAGTGGG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAG GCATGTCTTC 1122 0 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 113 4 0 

CAACTCATCT CCATTTACAC GTGTAAAGAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 114 60 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 1152 0 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 1158 0 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 117 00 

CAAGCGTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 117 60 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC AT AG AAGGT C CGTCAGCGAT 1182 0 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA GTGCGTTGGT TGTAAGCAGT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12 000 

AGCCGCACCA GAATCGTGGG CTGCTTGGTA TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 1212 0 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG TAGTCAACAA TATTAGCTGG ATACTCTTGG TTGACCCCTT GGTGACGTCC 12240 

CATGACAATC TTCTCAGCAA AGGTTCCATC TTCATTCAGA CGAGAGTTAG CCTGAGCTAC 12 300 

AGTATATTCA TCTTCTTCAT CAGCTGTCAA CCAAACAATT TCGTTCGTGA CAACACCTGT 123 60 

TTCACGGTCA ACCTTACGGT ATGGTGTTTG AACAAAACCA TATTTGTTCA AGTGTCCATA 1242 0 

AGATGACAAG TTATTGATCA AACCGATGTT AGGTCCTTCA GGTGTCTCGA TTGGACACAT 124 80 

ACG AC CAT AG TGAGTGTAGT GCACGTCACG TACTTCATAT CCAGCACGGT CACGAGTCAA 12 540 

ACCACCAGGT CCTAAGGCTG AC AAACGGC G TTTGTGAGAC AACTCAGAAA GCGGGTTGTG 12 600 

TTGGTCCATG AACTGTGACA ACTGTGATGA ACCAAAGAAT TCTTTAACTG C AGCTGT T AC 12660 

AGGACGGATA TTGATAATTT GTTGTGGTGT CAAGACTTCA TTGTCCTGAA C AG AC AT ACG 1272 0 

TTCACGGACA TTACGTTCCA TACGAGAAAG TCCCAAACGT ACTTGGTTGG CAAGCAATTC 12780 

ACCAACCGCA CGGATACGAC GATTTCCAAG GTGGTCGATA T C AT CT AC AC GGCCAAGTCC 12 84 0 

TTCAGCCAAG TTGAGGAAGT AGCTCATCTC AGCAAGGATA TCTGCAGGAG TCACCGTACG 12 900 

AACCTTGTCA TCTGGGTTAG CATTACCAAT GATCGTTACG ACGCGATCTG GATCAGTTGG 12 9 60 

AGCAATAACC TTGAATTTTT GAAGAACAAC AGGCTCAGTC ACAACGGCTG CATCGTTTGG 13 020 

GAT GT AG AC A ATCTTGTTCA AGTCGCCATC CAAATGGCTT TCAATGCTTT CAATCACGCT 13 080 

ACGAGTCATA AT CGT AC C AG CTTCTACCAA GATTTCTCCA GTTTCAGGGT CTACCAATGG 1314 0 

CTCTGCAATG GTTTGGTTGA GCAAACGTGT TTTAACATTG AGTTTTTTAT TGATTTTGTA 13200 

ACGACCAACT GCTGCCAAGT CATAACGACG TGGGTCAAAG AAGCGAGCTA CAAGCAAGCT 13260 

ACGTGAGCTT TCAGCCGTCT TAGGCTCACC TGGACGAAGG CGTTCGTAAA TTTCTTTCAA 13 320 

GGCTTCGTCT GTACGAGAGT CCATTGGATT CTTGTGGATA TCTTTTTCAA CAGTGTTGCG 13 3 80 

AACCAATTCG CTGTCACCAA AGAT AT C AAA GAT T T CAT C A TCACCTGAGA AACCAAGAGC 13440 

ACGAACCAAG GTTGTAAATG GAATCTTACG AGTACGGTCG ATACGAGTGT AGGTG AT AT C 13 5 00 

TTTTGAGTCG CTTTCAAGTT CCAACCAAGC TCCACGGTTA GGGATAACAG TTGAACCATA 13 5 60 

GCCCACCTTA CCATTTTTGT CTACTTTGTC GTTAAAGTAA ACACCTGGTG AGCGGACCAA 13 62 0 

CTGAGAAACG ATAATACGTT CACCACCATT GATGATGAAA GTACCCATTT CTGTCATGAT 13 6 80 

TGGGAAATCA CCAAAGAAAA CTTCTTGGGT CTTGATTTCG CTTGTTTCTT TATTGATCAA 13740 

ACGGAAGGTT ACAAAAATTG GTGCTGAGTA GCTAGCATCG TGGATACGAG CTTCTTCTAG 138 00 

CGTATATTTT GGTTCCTTGA TTTCATATCC AACAAATTCC AACTCCATTG TGTCTGTGAA 138 60 

GTTTGAAATT GGCAATACAT CTTCAAACAC TTCCTTAAGA CCGTGGTCTA GGAAAGCTTT 13 920 



WO 98/18931 



PCT/US97/19588 



814 

GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13 980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTGAACGTCA TGTCCTGCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT C TGT T AC AAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGAGCCTTGT CTGGACAATA T T T C AG AC AA AAC C T AC G AC AAATGATTAC 142 8 0 

C CAT AT TATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATC AC ATT C GAATCACGCT ATGGTACAAA AAACTGAAAA 144 00 

AACTATTGAC TG AAAAT CAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 144 60 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14 52 0 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA AT TTG AAGGG CTTTTTTTAT 145 80 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG AGCGAGTTTG TGGCAGAGTT 14 64 0 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 12 0 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 24 0 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 3 00 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 3 60 

AT AAC GGTC A CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 42 0 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGAC T AAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATC AC T T A GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAG AC C AG C 660 
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AT C AGG AT AC GGATGAAGGC AGTTTTGTAA AAGAGAAAAC TGTAATTTTC CAGAGCTTCA 720 

TTGACCCATT CGATTGAAAA AATCTGGGCA ATGAGTTGAA TCCCCATAAC AAGGTAGACC 780 

TTTTTGACGA TTGGATTATC AGTAAAGAAG AGAGGATAGG CT AGG AT AT A GACAGCAGTG 84 0 

GTCAAAATCG TACAAGCGAT GCACAAATAA AAAAGACTAG AAAAGGTTCT GTTAAGATCT 9 00 

TTTTTGTTAT CCTTGACATT ACTGATAGCC CTTAAACCGT AG T TAT AG AC AC C AT AAGTT 9 60 

GCAAAGGGCA AGAAAAATGA CAAAATAGTG TCGACTGAGT TGAAGTAACC ATAGTCAGTT 102 0 

CGGTCCAAGA CACGCGCGAC ATAGGTTCCA GTTAGGATGG GAAAAATAAT ATTCAAGACA 1080 

CGAATTCCCA TGTAAGATAG AGCATTTAAT TTTATACTTT TCATTCAATT TACCTCGTTT 1140 

T TC AT TAT AT CATAAAGTTA GCTAATAAGA AATGAAGGGC AGTAAGTCAA GTAATCACTT 12 00 

TG AAGTT T CA AATCTTAAGT TTTAAGTTTT CTTTAAGGAA AGTATATTAT TCTGAAGGAC 12 60 

TCTAAAATTT CGCAGCCATT TATTAGTAAT TGCTACAGAA TTCCTAGTCA TT AC TAG AAA 132 0 

TGGACTAGTT TCTTTGAATA ATAGAACTGC ATAATTCTCC TATTCTAGAA GGGGAGGACC 13 80 

AGTATTTCTT TTATGATAGG ACTAGATTGT GGTATAATAG AGAGAATAAG TTTTTTTAGT 144 0 

AAGACAAAGG AGAAAATAGA TGATTTATGC AGGAATTCTT GCCGGTGGAA CTGGCACACG 1500 

CATGGGGATC AGTAACTTGC CAAAACAATT TTTAGAGCTA GGTGATCGAC CTATTTTGAT 15 60 

T C AT AC AATT GAAAAAT T TG TCTTGGAGCC AAGTATTGAA AAAATTGTAG TTGGTGTTCA 1620 

TGG AG ACT GG GTTTCTCATG CAGAAGATCT TGTAGATAAA TATCTTCCTC TTTATAAGGA 16 80 

ACGTATCATC ATTACAAAGG GTGGTGCTGA CCGCAATACA AGTATTAAGA AC AT CAT T G A 1740 

AGCCATTGAT GCTTATCGTC CGCTTACTCC AG AGG AT AT C GTTGTTACCC ACGATTCTGT 1800 

TCGTCCATTT ATTACACTTC GCATGATTCA GGACAATATC CAACTTGCCC AAAATCATGA 18 60 

CGCAGTGGAC ACAGTGGTAG AAGCGGTTGA TACTATCGTT GAAAGTACCA ATGGTCAATT 192 0 

TATTACAGAT ATTCCAAATC GTGCTCACCT TTATCAAGGA CAAACACCTC AAACATTCCG 19 80 

TTGCAAGGAC TTCATGGACC TTTATGGATC TCTTTCTGAT GAAGAGAAGG AAATCTTGAC 2 040 

AGATGCATGT AAAATCTTTG TGATCAAAGG AAAAGATGTG GCTTTGGCCA AAGGTGAATA 2100 

CTCAAATCTG AAGATTACAA CCGTAACAGA TTTGAAGATT GCAAAAAGTA TGATTGAGAA 2160 

AGACTAGTAA AATGATTAAT CAAATTTATC AACTAACTAA GCCTAAGTTT AT C AATGT C A 222 0 

AATATCAGGA AGAGGCTATT GACCAAGAGA ATCATATCCT TATCCGTCCC AACTACATGG 2280 

CTGTCTGTCA TGCGGATCAG CGTTACTATC AGGGAAAACG TGATCCCAAG ATTTTGAATA 2340 

AAAAGCTTCC AATGGCAATG ATTCACGAGT CATGTGGAAC CGTCATTTCT GACCCGACCG 24 00 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2 4 60 

ATGAAGAATT CTATGAAAAC TACATGACAG GG AC CC AT T T CTTGTCTAGT GGATTTGATG 252 0 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCGTC 2 64 0 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCGCCGT TATTGGAGAT GGAAGTTTAG 2 700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AG C AG AG AT T GTGGTTATTG 2 7 60 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2 82 0 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATGGTACTG 2880 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCGCGATGC C T T AG AAAAG GGCTTGATTT 3 000 

TGGTTGGGTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3 060 

TCAAGAAATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GTAAGAGAAA 312 0 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 32 40 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3 3 00 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 33 60 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 342 0 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 3 4 80 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 354 0 

CCAATATTAC AAACT AT T C A TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3 600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3 6 60 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTG ACT T AGG TGTTGACAGA 372 0 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3780 

TATTTGATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3 84 0 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3 960 

TGGAGTCTAT GGACTGTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4 020 

AATCGTTACC AAAG AG CT AT TAAAGGTTTG GCTTCTTATG G AGGTT C AG A TGAAAAGTAA 4080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 414 0 

TG CT TAT AT C TTTTCGATTT T T AC AG ATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 42 00 
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111111 bAuL 


ATCTTTATCT 


TACTAGCTTT 


TCTCTTGGTA 


AAAGAAGGGA 


AAGTTCGCCT 


4260 


V_ 1 LAA1 1 1 1L 


■ II iii TV TV TV ill TV m fit 

TTAAATATTC 


/~* /^t tv tv m/^m/^ tv 

GCAATGTCAG 


TGTTATCATC 


GGAGCCTTGC 


TAGCAGGCCC 


4320 


1A1 CGGi A l G 


CAGGCCAATC 


TTTATGCAGT 


TAAGTATATC 


GGAAGTTCTT 


TAGCTTCATC 


4380 


1 G 1 ATCGGCT 


ATTTACCCTG 


CGATTTCAGT 


TCTATTGGCT 


TTCTTCTTTT 


TGAAGCACAA 


4440 


bAl 1 1 LGAAA 


AAT AC TGT AT 


TTGGGATTGT 


CTTGATTATT 


GGAGGGATTA 


TTGCTCAGAC 


4500 


/TT1 "A m TV TV /"*t m 

CTATAAGGTT 


T* Tk tk. * m i i mm -m 

GAACAGGTTA 


ATTCTTTCTA 


CATTGGGATT 


CTTTGTGCTT 


TGGTTTGTGC 


4560 


T AT TG C ATGG 


GGAAGTGAGA 


GTGTTCTTAG 


CTCTTTTGCC 


ATGGAAAGTG 


AATTGAGTGA 


4620 


AATCGAAGCC 


CTCTTAATCC 


GTCAAGTAAC 


TTCGTTCTTG 


TCCTATCTTG 


TGATTGTGCT 


4680 


CTTCTCTCAT 


CAGTCATTTA 


CTGCAGTAGC 


CAATGGACAA 


TTGCTAGGTC 


T C ATG AT TGT 


4740 


TTTTGCAGCC 


TTTGATATGA 


TTTCCTACTT 


GGCTTATTAT 


ATCGCTATCA 


ATCGCTTGCA 


4800 


ACCAGCCAAG 


GCTACAGGCT 


TGAACGTGAG 


CTATGTAGTA 


TGGACGGTCT 


TGTTTGCAGT 


4860 


TGTTTTCTTG 


GGTGCACCGC 


T AG AT ATGC T 


G AC CAT T ATG 


ACGTCACTTG 


TCGTCATTGC 


4920 


TGGAGTTTAT 


ATTATTATTA 


AAGAATAAAG 


GAGATTCGTG 


TG AAAG C CAT 


T AT CT T AG C A 


4980 


GCGGGATTGG 


GAACTCGCTT 


GCGTCCTATG 


ACTGAAAATA 


CCCCTAAAGC 


CTTGGTTCAG 


5040 


GTT AAT C AAA 


AACCTTTGAT 


TGAGTACCAA 


ATTGAGTTTC 


TCAAAGAAAA 


AGGAATCAAT 


5100 


GACATCATCA 


TCATTGTTGG 


T T AT CT T AAA 


GAACAATTCG 


ATT AC T TG AA 


AGAGAAATAC 


5160 


GGTGTTCGTC 


TCGTTTTCAA 


TGATAAATAC 


GCTGACTACA 


ATAACTTTTA 


CTCTCTCTAT 


5220 


CTTGTAAAAG 


AAGAATTGGC 


CAACAGCTAT 


GTT AT TG AT G 


CTGACAATTA 


TCTCTTTAAA 


5280 


tv tv m tv my^i mm 

AATATGTTCC 


GCAATGATTT 


GACACGTTCG 


ACTTATTTTA 


GTGTTTATCG 


TGAAGATTGT 


5340 


T\ /T/*"* TV TV /~1 TV TV m 

AC C AAC G AAT 


GGTTCTTGGT 


TTATGGAGAT 


GACTACAAGG 


TTCAAGACAT 


TATTGTTGAT 


5400 


AGCAAGGCAG 


GTCGCATCCT 


TAGTGGTGTA 


TCCTTCTGGG 


ATGCTCCAAC 


TGCAGAAAAG 


5460 


ATTGTCAGCT 


TTATCGACAA 


GGCTTATGTA 


AGTGGTGAAT 


TTGTTGATCT 


CTATTGGGAC 


5520 


AATATGGTTA 


AGGATAATAT 


CAAAGAGCTA 


GATGTCTATG 


TTGAAGAATT 


AGAAGGCAAT 


5580 


AGCATTTATG 


AGATCGATAG 


TGTCCAAGAC 


TATCGTAAAT 


TAGAAGAAAT 


TCTTAAAAAC 


5640 


GAAAATTAAA 


GATTCCAACA 


TCTGACAAAA 


TAGTCGGATG 


TTTTTTGATT 


TTTTACGAAC 


5700 


TTTTACGAAT 


AGATAGATGA 


GTAGAAAAAG 


AAATGGAGTT 


ATT T ATG AAA 


ATCACAAACT 


5760 


ATGAAATCTA 


TAAGTTAAAA 


AAATCAGGTT 


T G AC C AAT C A 


ACAGATTTTG 


AAAGTGCTAG 


5820 


AATACGGTGA 


AAATGTTGAT 


CAGGAGCTTT 


TGTTGGGTGA 


TATTGCAGAT 


ATCTCAGGTT 


5880 


GCCGTAATCC 


AGCCGTTTTT 


ATGGAACGTT 


ATTTTCAGAT 


AGACGATGCG 


CATTTGTCGA 


5940 
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AAGAGTTTCA 


AAAATTTCCA 


TCTTTCTCTA 


TTTTAGATGA 


CTGTTATCCT 


TGGGATTTGA 


6000 


GTGAAATATA 


TGATGCGCCT 


GTACTTTTAT 


TTTACAAGGG 


AAATCTTGAC 


CTCCTGAAAT 


6060 


TCCCGAAGGT 


AGCGGTCGTG 


GGCAGTCGTG 


CTTGTAGCAA 


AC AGGG AG CT 


AAGTCAGTTG 


6120 


AAAAAGTCAT 


TCAAGGCTTG 


GAAAATGAAC 


TGGTTATTGT 


CAGTGGTCTG 


GCCAAGGGCA 


6180 


TTGACACAGC 


AGCTCATATG 


GCAGCTCTTC 


AGAATGGCGG 


AAAAACCATT 


GCAGTGATTG 


6240 


GAACAGGACT 


GGATGTGTTT 


TATCCTAAAG 


CCAATAAACG 


CTTGCAAGAC 


TACATCGGCA 


6300 


ATGACCATCT 


GGTTCTAAGT 


GAATATGGAC 


CTGGTGAACA 


ACCTCTGAAA 


TTTCATTTTC 


6360 


CTGCCCGTAA 


TCGCATCATT 


GCTGGACTTT 


GTCGTGGTGT 


GATTGTAGCA 


GAGGCTAAGA 


6420 


TGCGTTCAGG 


TAGTCTCATT 


ACGTGTGAGC 


GAGCAATGGA 


AGAAGGACGC 


GATGTCTTTG 


6480 


CTATTCCTGG 


TAGCATTTTA 


GATGGACTAT 


CAGACGGTTG 


CCATCATTTG 


ATTCAAGAAG 


6540 


GAGCAAAATT 


GGTCACCAGT 


GGGCAAGATG 


TTCTTGCGGA 


ATTTGAATTT 


TAAAAATGAC 


6600 


C T AAGCT AG A 


ATTCTAAGAA 


AAAAT C AATT 


TTAAGAGAAA 


ATGAACCCAA 


CATTTCCATA 


6660 


AT AAAACG C A 


TATTAGCAAG 


TTTTTAACAC 


TTGATAATAT 


GCGTTTTTTC 


TAAGTGGATT 


6720 


AGT AG AG TAG 


AGGATTTTTC 


TCATATAATA 


CTCTTCGAAA 


ATCTCTTCAA 


ACTACGTCAG 


6780 


CTTCCATCTG 


CAACCTCAAA 


ACAGTATTTT 


GAGCgaCTtC 


GTCAGTCTTA 


TCTACAACCT 


6840 


CAAAGCAGTG 


CTTTGAGCAA 


CCTGTGGCTA 


GCTTCCTAGT 


TTGCGCTTTG 


ATTTTCATTG 


6900 


AGTATAAGGG 


AAAGTATAGT 


GAATTGAAAT 


AAGATGTGAA 


CAACTCTATC 


AGGAAAGTCA 


6960 


AATTAATTTA 


TAGAAATATT 


TTAGCAGCCA 


AGGTGTACTG 


TTATAGATTC 


AAT T AC ACT A 


7020 


TAATTTAGTG 


TAATTGAGAA 


AGGAGAAATG 


ATTGTGATTG 


ATGTTGGCTA 


GGTTATGTTC 


7080 


AATGATTCCT 


ACCGTCTCAA 


ATCTTGTCAG 


TAAGGAAAAA 


TAAATTCTTC 


AAAAGTAGAG 


7140 


ATTACAAGGC 


TTGTTTAAGA 


AAGAATTCAA 


AGACCTTGAC 


AAATAAAAAT 


AAAATGGTTA 


7200 


TTATAAAAAA 


TGGTCTGAAA 


TAGATGATGA 


TACTTTTCGA 


AAATCTCTTC 


AAATACGTCA 


7260 


GCTCAGCTTT 


GCCTTGCTGT 


GTTTTGAGCA 


AGCTACGGTT 


AGCTTCCGAG 


TTTGATTTTC 


7320 


ATTTACTAGA 


AATGAAACT G 


AT GAG AG AT A 


TCAGTAGACA 


TTTGAGTCAG 


GATATTATGG 


7380 


AAAATGATAA 


AAAGAGCTCG 


TGAGATTGGC 


ATATCAGACT 


ACTAAAGTAT 


TGAGTTTGTT 


7440 


AGGATTTTAG 


CGACTAGTTA 


GCTGGGAAAG 


GAAGATATTT 


GTGACAAATA 


ATAAACTGTA 


7500 


TTCGTTGATA 


GAATTTAGAA 


ATAAAATATA 


TGAAGAATTA 


GAACTTTCCA 


GAAGTGATTT 


7560 


AGCGATTTTA 


CTATGTGCCA 


TGCTTATCGC 


CTCTATCGGA 


TTAAATATGG 


ATTCGACTCC 


7620 


CGTGATTATT 


GGAGCCATGT 


TAATCTCTCC 


TTTGATGACA 


CCTATTCTGG 


GAGTGGGGCT 


7680 


CTCTCTAGCT 


ATATTTGATT 


TTAAATTGTT 


AAGAAAATCT 


TT T AAAAT AT 


TAGCTATTCA 


7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 78 6 0 

AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7 902 



(2) INFORMATION FOR SEQ ID NO : 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18627 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 
TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGGTGAT TTTTCTTGAC GATAGAAGTT 12 0 

TCAGCGACCA TCATTTTTGA AC AGT G AT AG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 
CTAGTAGGCA TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 24 0 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 3 00 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GG AT AT TAG G GTCTTTAATG 3 60 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 42 0 

GTCGCTTTTC ATT AT AGGT C ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 4 80 

AT CT AT AAGG GATTTACCCA CTACAAATAT T AT AGAGC C A AAAATCCTTT GTTTACTAAA 54 0 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT T TTTG AT AT A ATAGTTCTAT GTTAAAATCA 60 0 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 6 60 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 720 
ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG CCCCTTCCAA 7 80 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 84 0 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 
AGT CT AG C T A AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 9 60 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT T G AAAAG AC A GCATGGGACT TTACGATGTT TGAGGAAAAA 1080 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA G C AAG AG AAA ATAAGTCACG 12 00 
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CTTATCTTTT TTCTGGTCCT CGTGGAACGG G AAAAAC C AG TGTTGCTAAA ATCTTTGCCA 12 60 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 13 20 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGT AG AT G A AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GG AAAT CAT T GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAAT T AC T GG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 18 60 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 192 0 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 204 0 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT AGTCGACCAG 22 80 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 234 0 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2400 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 2460 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT GGTCAAACTA 252 0 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2 58 0 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT C C AG AAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 27 60 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTG AG AC C T 282 0 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2 880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAAT C AAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGGGAAATC 3 0 60 

AGACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGGAACTA TCATGTCAGT 312 0 

ATTAGAGATC AAAGATCTTC ACGTTGAGAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 3180 

CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3 24 0 

GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA GTAACTAAAG GTGAAGTTTT 3 3 00 

GTTTGATGGC GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TGGGACTTTT 3 3 60 

CCTTGCTATG CAATACCCAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCGTGC 3 42 0 

CGCTATGAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA GTTCGTGAGT TTATTACTAA 3480 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3 540 

CGAAGGCTTC TCTGGTGGTG AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGGA 3 6 00 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3 660 

TGTGTCTAAA GGTGTCAATG CCATGCGTGG TGAAGGTTTT GGTGCTATGA TCATCACTCA 3 72 0 

CTACCAACGT CTTTTGAACT AT ATC AC AC C TGATGTGGTA CACGTGATGA TGGAAGGTCG 3780 

TGTTGTCCTT TCTGGTGGTC CAGAATTGGC TGCGCGTTTG GAACGTGAAG GATACGCAAA 3 840 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3 900 

GGAGAAGTAA ATGACTAGAG AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3 960 

CTGGTTGGCT GATCTCCGTC AAAAAGCTTT TGACAAGATT GAGACTTTGG AATTACCAGT 4020 

TATTGAGTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA C AG AAAAT G A 4 080 

GCCATCAGCA AATGTTCCAG ATTTCACAGC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTCAA ACTGTTTTCG AACAAACTCC AGTTGAGTTA GCTGAACAGG GTGTTGTCTT 4 200 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 42 60 

ATCTGTTAAG TATGATGATG ACAAGTTGGC GGCTTACCAC ACAGCTTACT TTAACAGTGG 432 0 

TGCTGTACTC TAT ATT CC AG ATAACGTAGA AAT C AC AG AG CCAATTGAAG GAATTTTCTA 4 3 80 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 4440 

TTCTAAGATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 4500 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCG CAAGTCAAGT TTGCTGCTAT 45 60 

CGACCGTCTA GGTGAAAACG TCACTGCCTA CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4620 

TGCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTGATTTTGA 4 6 80 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAG GTTGTAGCTC TTTCAAGTGG 474 0 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACT T T C AATG GTATCGGCCA 4 8 60 

CAT C AT C AAG GGTGCTAAGG GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA T C C AG AAG AT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 5100 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG AT GT AG AAG C GATTCGCAAG GATTTTCCAA 522 0 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAACCACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5 34 0 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 5400 

AAACCATTCG TAAGTTTATT AATGCAGGCT C T AC AAAGG A AGTTCTCTTT ACCAGAGGAA 54 60 

CGACAACCAG CCTTAACTGG GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 552 0 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5 640 

ATTTGCGAGC T AAAT T G ACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG AT C AAGG AAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 57 60 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5 82 0 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 588 0 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5 94 0 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

G AACG C C AAA TATGGCAGGA G C T ATT GG AC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 612 0 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG '5180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CT GG AT TAT G 62 40 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 6300 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 63 60 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACT AG AT AG C 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC AT C AC C AAGG GAAGTTAGAA 64 8 0 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG CAGAGGACCG TTTGGAAGAT ATTGCTTTTC TAAATTCAGG ATGCACGATT 6 600 

TCAACTGCTT CTGCTAGTAT GATGACAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 6660 

TTAGAACTGG CGACTATTTT TTCTGAAATG GTTCAAGGGC AAAAAGATGA GCGTCAAGAC 672 0 

CAACTTGGAG ACGCGGCATT CTTGTCAGGT GTTGCCAAAT TCCCTCAAAG AATCAAGTGT 67 80 

GCAACCCTAG CTTGGAATGC CCTTAAGAAA AC AAT T G AAA ATCAAGAAAA ACAGTAAGAC 6840 

AAGTTTCTTT TGTCTTATGA ATTATTAGAA ATGAAGAAAG AAAGG AT AC T ATGGCTGAAG 6900 

AAAGAGTAGA ACCAAAACCA ATTGACCTTG GTGAATATAA ATTTGGTTTC CATGACGATG 6960 

TAGAGCCTGT CTTATCGACA GGAAAAGGAC TCAACGAAGG TGTTATTCGT GAATTATCTG 702 0 

CTGCTAAGGG TGAGCCTGAG TGGATGTTGG AGTTCCGTTT GAAGTCTTAT GAAACCTTCA 7080 

AAAAAATGCC CATGCAAACT TGGGGAGCAG ACTTGTCAGA GATTGACTTT GATGACTTAA 7140 

TCTACTACCA AAAACCATCT GACAAACCAG CCCGTTCTTG GGATGATGTA CCTGAAAAGA 72 00 

TTAAAGAAAC CTTTGAACGT ATCGGGATTC CAGAAGCTGA ACGTGCTTAT TTAGCAGGGG 72 60 

CTTCTGCCCA GTACGAGTCA GAAGTGGTTT ACCACAACAT GAAGGAAGAG TTCCAAAAAT 73 2 0 

TAGGTATTAT CTTTACAGAT ACAGATTCCG CACTCAAGGA AT ACC C AG AC TTATTTAAAC 73 8 0 

AATACTTTGC GAAGTTGGTA CCGCCGACAG ATAACAAGTT GGCAGCCCTC AACTCAGCAG 7440 

TATGGTCGGG TGGAACTTTT ATCTACGTGC CAAAAGGTGT CAAGGTAGAT ATTCCACTTC 7 500 

AAACTTATTT CCGTATCAAT AACGAAAATA TAGGTCAGTT CGAACGTACC TTGATTATCG 7 5 60 

TTGATGAGGG AGCAAGCGTC TACTACGTAG AAGGATGTAC AGCACCAACA TATTCAAGCA 7 62 0 

ATAGCTTACA CGCTGCCATT GTAGAAATTT TTGCTTTGGA CGGAGCTTAT ATGCGTTATA 7 680 

CAACTATCCA AAACTGGTCT GATAACGTCT ATAACTTGGT AACAAAGCGT GCTAAGGCTC 774 0 

AAAAGGATGC CACTGTTGAG TGGATTGATG GAAACTTGGG TGCCAAAACG ACTATGAAAT 7 8 00 

ATCCATCTGT TTACCTTGAT GGAGAAGGAG CGCGTGGTAC CATGCTCTCT ATCGCCTTTG 7860 

CTAATGCAGG GCAACACCAA GACACGGGTG CTAAGATGAT TCACAATGCT CCACATACCA 79 2 0 

GCTCGTCTAT TGTGTCTAAA TCCATCGCTA AAGGTGGAGG AAAGGTTGAC T AC CGTGG AC 7 980 

AAGTCACCTT TAACAAGAAC TCTAAGAAAT CTGTTTCCCA CATTGAATGT GATACCATTA 8040 

TCATGGATGA CTTGTCAGCA T C AG AT ACT A TTCCATTTAA TGAAATTCAC AACTCGCAAG 8100 

TGGCTTTGGA ACACGAAGCC AAAGTATCTA AGATTTCAGA AGAGCAATTG TATTATCTCA 8160 

TGAGCCGTGG ATTGTCAGAA TCTGAGGCAA CTGAAATGAT TGTCATGGGA TTTGTAGAAC 822 0 

CCTTTACAAA AGAACTTCCA ATGGAATACG CAGTTGAGCT GAACCGCTTG ATTAGCTATG 82 80 
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AAATGGAGGG AT C AGTTGG A TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 834 0 

ACGT C AG C AT CGCCTTACCG TATGTATGGT TwCTGAtTCG TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC tGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 84 60 

AG AT TT AC TC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 852 0 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 858 0 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTT ACT GT A 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACG 8880 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GGTTGAAGTG 912 0 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 924 0 

TCTTTTTTAG AAC CTGGGT A AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT C AT ATCG AC G 93 60 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 942 0 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 94 8 0 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT C AG AAAT AT C TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9600 

GTTATGGAAG CAATTTCGAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9 6 60 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 97 80 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9 84 0 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGG AC C AC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 99 6 0 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 



WO 98/18931 



PCT/US97/19588 



825 

ATCGTTTGCT TGGTATTTTA ATTACCTTGT GTACTTTGGT CTGGGGAATG GTCATAGGTG 1014 0 

TTGTCTATCT CTTACCTATT TTGATTAATC AGTTATCTAG TTTGATTATA TCTAGTCAAA 10200 

CTATTTATAG TCGAGTACAA G ACT T AAT C A TAGACTTATC TAATTATCCT GCGCTCCAGA 10260 

ATTTGGATGT AGAAGCTACA ATTCAGCAGT TAAACTTATC CTATGTTGAT ATTCTTCAAA 10320 

ATATCCTAAA TAGCGTATCA AATAGTGTGG GGAGCGTCTT GTCAGCTCTT ATCAGTACTG 103 8 0 

TTTTGATTTT GATTATGACT CCAGTTTTTT TGGTTTATTT C T TAT TAG AT GGACATAAAT 10440 

TCTTGCCCAT GCTTGAAAGA ACGATTCTAA AGAGGGATCG CTTGCATATT GCAGGCTTAT 10500 

TAAAGAATTT AAATGCGACG ATTGCTCGCT ATATTAGTGG AGTTTCGATT GACGCAATCA 105 60 

TTATAGGTTG TTTGGCTTAT ATTGG CT AT A GTATTATTGG TTTAAAATAT GCTTTAGTTT 10620 

TTGCCATTTT TTCTGGTGTA GCCAATTTAA TTCCTTATGT GGGGCCAAGT ATTGGTTTGA 10 680 

TTCCTATGAT CAT C GC AAAT ATATTCACTG ATCCCCATAG ACTGCTGATT GCAGTGATTT 1074 0 

ATATGCTTGT TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAAGTG 10800 

TTATGAAGGT T CAT C C AAT C AC GAT T T TAG TTTTACTTTT GTTGTCAAGC AATATCTATG 1086 0 

GTGTAGTTGG AATGATTGTC GCAGTGCCAA C CT ATT C TAT CTTGAAAGAA ATTTCTAAGT 1092 0 

TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA G AAT TAG C T A 10980 

AGTAAAAGTC AGGAGAACCC TGATTTTTCT TTACTGGAAG TGGCCTTTAG AT T AG AAG AC 11040 

TGAAAATAAG TTAAAGTCTT AAACTAATTT TCACAGCTAA GAATAGTAGA AGTTAATCTG 11100 

AT AAAAAT CG AAAAAACCAG TGGAATTCTG TGTCAGGGTA AGTTCCACTG GTTTTCATAG 11160 

TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 1122 0 

AATTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TCTATTCGTT CTATTCTTAT 112 80 

TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT C C G AAT AAT A TAAGGATTGA 1134 0 

TTTTATAGTT TTTAAACTCA AATGAATTGA AATAAAGAGA GTACGAAAAT TCTCATCTGA 114 00 

AAGTATTTTA G AAT AAT T C T CTTCGTGAAT TTCTTCAAAA CAGATAGCTT CATCTTAGGT 114 60 

ATGTGATTTC TTTTTGCATT TTTGAGTTAG ATAAGGTATA ATGATTTTAT TGTCTTTTGG 1152 0 

GGTCGTTACG GAT T CG AC AG GCATTATGAG GCATATTTTG CGACTCGTGT GGCGACGTAA 115 8 0 

ACGCTCAGTT AAATATAACT GCAAAAAATA ACACTTCTTA CGCTCTAGCT GCCTAAAAAC 1164 0 

CAGCAGGCGT GACCCGATTT GGATTGCTCG TGTTCAATGA CAGGTCTTAT TAT T AGCG AG 1170 0 

AT ACG AT TAA GCCTTGTCTA GCGGTTTGAT AAGAGATTGA TAGACTCGCA GTTTCTAGAC 117 60 

TTGAGTTATG TGTCGAGGGG CTGTTAAAAT AATACATAAC CTATGGTTGT AGACAAATAT 1182 0 
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GTTGGCAGGT GTTTGGACGT GGGTTCGACT CCCACCGGCT C C ATT ATT C C TTTGCATTCT 1188 0 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTGGTATTC 11940 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGGATAGA 12 000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 12060 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC TCTTAAGCCT 1212 0 

CTTGAGTAAT CTTCCAATAC T T ACT C AAAT AGTCTACCAT AAAGGCTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 1224 0 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12 3 00 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TC AAC AT CAT GTAGTTTCTC 123 60 

TAGAAAAATT AGTTAGTTTG C T AG AAACGT CAGAATCAAC GGTTCGAAGA GACTTGGATG 1242 0 

AGTTGGAAGC GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC CCCTACTCCT 12480 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAGAAATTGC 12 54 0 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12 6 60 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 1272 0 

ACGTCAAGAC GG CG AC AG AT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 127 80 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCTAT TATACGACTC 12 84 0 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTGGA GAATGCCAAG C AG AC CT AC G 12 900 

TCTTGGTGGA TTCGTCAAAA AT T GG AC AAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12 9 60 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13 020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 1308 0 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA ATAC AG CG AC 132 00 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT 13260 

CGAGACACGT TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGT T AAAA TCAAAGCAGA 1332 0 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTCAGC TAGAAGAATT 13 3 80 

GAAAGCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

T AAAAAT CT A GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTGGTGC 13 500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT ACCAGCCTCT 13 560 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC TCGAAAGTTT 13 620 
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AGATGAAATT GAGAAATACG CTCGTGAGTT ACTGGCTAAG GGTGCTCAAA AT GTT ATT AT 13 680 

CTCTATGGCT GGTGATGGTG CCCTTCTTGT CACATCTGAG GGAGCTTACT TCGCTAAACC 13 74 0 

AATCAAAGGA ACAGTCAAAA ATTCAGTTGG AGCTGGTGAT TCTATGGTTG CTGGATTCAC 13800 

AGGTGAATTT GTCAAATCAA AAGACGTAGT AGAAGCCTTC AAATGGGGAG TGGCTTGCGG 13 8 60 

AACGGCAACT ACCTTCTCAG ATGACTTGGC AACGGCGGAA TTTATTAAAG AAACATATGG 13 920 

AAAAGTT G AG GTAGAAAAAC GATGAAAATT CAAGACCTAT TGAGAAAAGA TGTCATGTTG 13 980 

CTAGATTTGC AGGCAACTGA AAAAACAGCT GTCATCGACG AGATGATTAA AAATTTGACA 14 040 

GACCACGGTT ATGTAACAGA TTTTGAAACA TTTAAAGAAG GAATTTTGGC GCGTGAAGCT 14100 

TTGACTTCTA CTGGTTTGGG TGATGGAATC GCAATGCCTC ACAGCAAAAA CGCTGCTGTC 14160 

AAAGAAGCGA CAGTTCTATT TGCTAAGTCA AATAAGGGTG TTGACTACGA GAGCTTGGAT 1422 0 

GGACAAGCAA CTGACCTCTT CTTCATGATT GCAGCTCCAG AAGGTGCCAA TGATACTCAC 142 80 

TTGGCAGCCT TGGCAGAATT GTCTCAATAC TTGATGAAAG ACGGTTTTGC AG AC AAAC T T 1434 0 

CGTCAAGCAA CATCTGCAGA CCAAGTTATC GAACTTTTTG ACCAAGCTTC AGAAAAAACT 14400 

GAGGAACTTG TTCAAGCACC TGCTAATGAC TCTGGTGACT TTATCGTAGC TGTTACAGCT 144 60 

TGTACAACAG GTATTGCCCA CACTTACATG GCCCAAGAAG CCCTTCAAAA AGTAGCTGCT 14 520 

GAAATGGGGG TTGGTATCAA GGTCGAAACC AACGGTGCTA GCGGTGTTGG AAATCAACTA 14 580 

ACTGCAGAAG ATATCCGTAA GGCTAAAGCT AT TAT CAT T G CAGCAGACAA GGCCGTTGAA 14 64 0 

ATGGATCGAT TTGATGGAAA ACCATTGATC AATCGTCCAG TTGCTGACGG TATCCGTAAG 14700 

ACAGAAGAGC TAATTAACTT GGCTCTTTCA GGAGATACTG AAGTCTACCG TGCCGCTAAT 147 6 0 

GGTGCCAAAG CTGCAACAGC CTCTAACGAA AAACAAAGCC TTGGTGGTGC CTTGTACAAA 14820 

CACTTGATGA GTGGTGTATC TCAAATGTTA CCATTCGTTA TCGGTGGTGG TATCATGATT 14880 

GCCCTTGCCT TCTTGATTGA CGGTGCTTTG GGTGTTCCAA ATGAAAACCT TGGCAATCTT 14 94 0 

GGTTCTTACC ATGAGTTAGC TTCTATGTTC ATGAAAATTG GTGGAGCTGC CTTTGGTTTG 15000 

ATGCTTCCAG TCTTTGCGGG TTATGTTGCC TACTCTATTG CTGAAAAACC GGGTTTGGTA 15060 

GCAGGTTTCG TGGCTGGTGC TATTGCCAAA GAAGGTTTTG CCTTTGGTAA AATTCCTTAT 1512 0 

GCCGCAGGTG GTGAAGCAAC TTCAACTCTT GCAGGTGTCT CATCTGGTTT CCTAGGTGCC 15180 

CTTGTTGGTG GATTTATCGC AGGTGCCTTG GTTCTTGCCA TCAAGAAATA CGTTAAAGTT 15240 

CCTCGTTCAC TCGAAGGTGC TAAATCAATC CTTCTATTGC CACTTCTTGG AACAATCTTG 15300 

AC AGG AT TTG TTATGCTAGC TGTGAAT AT C CCAATGGCTG CAATCAACAC TGCTATGAAT 153 60 
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GACTTCCTAG GCGGTCTTGG AGGAGGTTCA GCTGTCCTTC TTGGTATCGT CCTTGGTGGA 15420 

ATGATGGCTG TTGACATGGG TGGACCAGTT AATAAAGCAG CTTATGTCTT TGGTACAGGT 15480 

ACGCTTGCAG CAACTGTTTC TTCAGGTGGT TCTGTAGCCA TGGCAGCAGT TATGGCTGGA 1554 0 

GGAATGGTGC CACCACTTGC AATCTTTGTC GCAACTCTTC TTTTCAAAGA TAAATTTACT 15600 

AAGGAAGAAC GTAACTCTGG TTTGACAAAC ATCATCATGG GCTTGTCATT TATCACTGAG 156 60 

GGAGCGATTC CATTTGGTGC CGCTGACCCA GCTCGTGCGA TTCCAAGCTT CATCCTTGGT 15720 

TCAGCAGTAG CAGGTGGACT CGTTGGTCTT ACTGGT AT C A AACTCATGGC GCCACACGGA 15780 

GGAATCTTCG TTATCGCCCT TACTTCAAAT GCTCTCCTTT ACCTCGTTTC TGTCTTGGTA 15840 

GGAGCAATCG TAAGTGGTGT GGTTTATGGT TACCTACGCA AACCACAAGC ATAAAAAATA 159 00 

GAAAAATGAA AAG AT TGG AC CGTTTGGTGC AGTCTTTTTC TCTTCCCGAA ATGCCTGTGA 15 9 60 

AATATGGTAT AATAGAAGAA TGGCAAACAA GAATACAAGT ACAACAAGAC GGAGACCGTC 1602 0 

TAAAGCAGAA CTGGAAAGAA AAGAAGCGAT TCAACGAATG TTGATTTCGT TAGGAATTGC 16080 

GATTTTATTG ATTTTCGCAG CCTTCAAATT AGGGGCTGCA GGTATAACCC TTTATAATTT 16140 

AATTCGCTTG CTAGTGGGTA GCCTAGCTTA TCTGGCGATA TTCGGCCTAT TAATCTATCT 162 00 

CTTCTTTTTC AAGTGGATAC GAAAACAGGA AGGACTCTTA TCTGGCTTTT T C AC CAT AT T 16260 

TGCTGGCTTA CTCTTGATTT TTGAGGCCTA CTTGGTTTGG AAATATGGTT TGGACAAGTC 1632 0 

CGTTCTAAAA GGGACCATGG CTCAGGTTGT G AC AG AT C TG ACTGGTTTTC GAACGACTAG 163 80 

CTTTGCTGGA GGGGGCTTGA TCGGGGTCGC T CT T TAT AT T CCAACAGCCT TTCTCTTTTC 1644 0 

AAATATCGGA ACTTACTTTA TTGGTTCTAT CTTGATTTTA GTGGGTTCTC TCCTAGTCAG 16500 

CCCTTGGTCT GTTTACGATA TTGCTGAATT TTTCAGTAGA GGCTTTGCCA AATGGTGGGA 165 60 

AGGGCACGAG CGTCGAAAAG AGGAACGCTT TGTCAAACAA GAAGAAAAAG CTCGCCAAAA 16620 

GGCTGAGAAA GAGGCTAGAT TAGAACAAGA AGAGACTGAA AAAGCCTTAC TCGATTTGCC 16 6 80 

TCCTGTTGAT ATGGAAACGG GTGAAATTCT GACAGAGGAA GCTGTTCAAA ATCTTCCACC 16740 

TATTCCAGAA GAAAAGTGGG TGGAACCAGA AATCATCCTG CCTCAAGCTG AACTTAAATT 16800 

CCCTGAACAG GAAGATGACT CAGATGACGA AGATGTTCAG GTCGATTTTT CAGCCAAAGA 16860 

AGCCCTTGAA TACAAACTTC CAAGCTTACA ACTCTTTGCA C C AG AT AAAC CAAAAGATCA 16920 

GTCTAAAGAG AAGAAAATTG TCAGAGAAAA TATCAAAATC TT AG AAG C AA CCTTTGCTAG 16980 

CTTTGGTATT AAGGTAACAG TTGAACGGGC CGAAATTGGG CCATCAGTGA CCAAGTATGA 17040 

AGTCAAGCCG GCTGTTGGTG TAAGGGTCAA CCGCATTTCC AATCTATCAG ATGACCTCGC 17100 

TCTAGCCTTG GCTGCCAAAG ATGTCCGGAT TGAAGCACCA ATCCCTGGGA AATCCCTAAT 17160 
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CGGAATTGAA GTGCCCAACT CCGATATTGC CACTGTATCT TTCCGAGAAC TATGGGAACA 17220 

ATCGCAAACG AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG TTAATGGAAC 172 80 

CGCAAGAGCT TTTGACCTTT CTAAAATGCC CCACTTGCTA GTTGCAGGTT CAACGGGTTC 17340 

AGGGAAGTCA GTAGCAGTTA ACGGCATTAT TGCTAGCATT CTCATGAAGG CGAGACCAGA 17400 

TCAAGTTAAA TTTATGATGG TCGATCCCAA GATGGTTGAG TTATCTGTTT ACAATGATAT 174 60 

TCCCCACCTC TTGATTCCAG TCGTGACCAA TCCACGCAAA GCCAGCAAGG CTCTGCAAAA 17 52 0 

GGTTGTGGAT GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG TTCGGAATAT 17580 

TGCAGGTTTT AATGCCAAGG TAGAAGAGTT CAATTCCCAG TCTGAGTACA AGCAAATTCC 17 640 

GCTACCATTC ATTGTCGTGA TTGTGGATGA GTTGGCTGAC CTCATGATGG TGGCCAGCAA 17 700 

GGAAGTGGAA G ATGCT AT C A TCCGTCTTGG GCAGAAGGCG CGTGCTGCAG GTATCCACAT 177 60 

GATTCTTGCA ACTCAGCGTC CATCTGTTGA TGTCATCTCT GGTTTGATTA AGGCCAATGT 17 82 0 

TCCATCTCGT GTAGCATTTG CGGTTTCATC AGGAACAGAC TCCCGTACGA TTTTGGATGA 17 880 

AAATGGAGCA GAAAAACTTC TTGGTCGAGG AGACATGCTC TTTAAACCGA TTGATGAAAA 17 94 0 

TCATCCAGTT CGTCTCCAAG GCTCCTTTAT CTCGGATGAC GATGTTGAGC GCATTGTGAA 18000 

CTTCATCAAG ACTCAGGCAG ATGCAGACTA CGATGAGAGT TTTGATCCAG GTGAGGTTTC 18 060 

TGAAAATGAA GGAGAATTTT CGGATGGAGA TGCTGGTGGT GATCCGCTTT TTGAAGAAGC 1812 0 

TAAGTCTTTG GTTATCGAAA CACAGAAAGC CAGTGCGTCT ATGATTCAGC GTCGTTTATC 18180 

AGTTGGATTT AACCGTG CGA CCCGTCTCAT GGAAGAACTG GAG AT AG C AG GTGTCATCGG 18240 

TCCAGCTGAA GGTACCAAAC CTCGAAAAGT GTTACAACAA TAAAAAAATA GCTTCTTTCC 18300 

AAGTTTGGAG GGAAGCTATT TTAGTGGCTA TTGATTGCTT TTATTTTCTG AAGTTGGCGC 183 60 

ATTGGACTGT TTTTCGTTTT CAGTAGCAGG TTTACTTGAA GCAGGAGTAG AAGAGTCCTG 1842 0 

AGTTGCTGTT TTCTGATCTT CTTTTTTCTC TTCCTTGACG CTAGATTTTG GTGTTTCCTC 18480 

TTGCTGTGTT TTTTCTTGAC TAGTGTTAGT CTCTTTAGTT GGACTGGTGT TTTCCTTAGG 18 540 

GGATTCCTTT TGGATTTCTT TGACAATGGT TGTCGTCTGG CTTGTCGTAG GTTCTTTTTT 18 600 

AATATTTTTG TTATTATCCA AGGCGTT 18 627 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 114: 

TAAAATACGT TACCTTGCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGG TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 3 00 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 3 60 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA GCAATTCTTG AGACACAACG 42 0 

TGAACTTCGT GAAGTCTTGA ATGATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 4 80 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 540 

TGTTTTCACT GATAATGATT ACACGTACGA AG AG AT G AAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 72 0 

AGCAGCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 7 80 

TCCTCAAGCT AT C AATG C AG TTGGTTCGCG TGATACCTTT GTTGGTCGTA TCCGTAAAGA 84 0 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 9 00 

TGCTTGGAAC T C AGTTC AG A TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 9 60 

AGCCGAATTG AAATTTGAAT TAAAATAGTC ATATCGTTTA GG AGTTC AG A TGAACTCCTT 102 0 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT AT C AAG AT T T AAAAAAATGT AAAAT C ATT A 10 80 

CAGCCTTTAT TACCCCCTTC CATGAGGATG GT T C C ATT AA CTTTGATGCT ATTCCAGCCT 114 0 

TGATTGAGCA TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 12 00 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTG 12 60 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TAGGTACTAA TGATACGCGT GACTCTATTG 132 0 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 13 80 

AC T AC AAC AA ACCTTCTCAA GAAGGGATGT ATCAGCACTT T AAG ACT ATT GCAGATGCTT 1440 

CTGACCTACC AATT ATT AT C TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT G AC C AT CC AA ATATTATCGG TGTCAAAGAA TGTACTAGCT 15 60 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 1620 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 16 8 0 
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C T CAT AC AAA TGGGGATGAA ATGCACGAGA TGTTTACTGC GATTGCAGAA AGCGATATGA 174 0 

AGAAAGCCGC AGCAATTCAG CGTAAATTCA TTCCTAAGGT TAATGCTCTC TTCTCTTATC 1800 

CAAGTCCTGC TCCAGTTAAG GCAATTCTTA ACTATATGGG ATTTGAAGCT GGACCCACTC 18 60 

GTCTACCTCT TGTTCCAGCA CCAGAAGAAG ATGCCAAACG CATTATCAAG GTTGTCGTAG 192 0 

AT GG CG ACT A CGAAGCAACT AAGGCAACTG TAACAGGGGT CTTAAGACCA GATTACTAAT 1980 

AAAGACAATA AAATCCGGCT CTTTGTCAAC TGTAGTGGGT TGAAGTCAGC TAAGCTCGAG 204 0 

AAAGGACAAA TTTTGTCCTT TCTTTTTTGA TATTCAGAGC G AT AAAAAT C CGTTTTTTGA 2100 

AGTTTTCAAA GTTCCGAAAA CCAAAGGCAT TGCGCTTGAT AAGTTTGATG AG ATT ATT GG 2160 

TCGCTTCCAA TTTGGCGTTT GAATAGGGTA GTTGAAGGGT GTTGACGATT TTCTTTTTGT 2220 

CCTTTAGAAA GGTTTTAAAG ACAGTCTGAA AAATAGGATG AACCTGCTTC AGATTGTCCT 2 2 80 

CAATGAGTCC G AAAAATT TC TCCGGTTCCT TATTCTGAAA GTGAAACAGC AAGAGTTGAT 2 34 0 

AGAGCTGATA GTGATGTTTC AAGTTTTGTG AATAGCTCAA AAGCTTGTTT AAAATCTCTT 2400 

TATTGGTTAA GTGCATACGA AAAGTAGGAC GATAAAATCG CTTATCACTC AGTTTACGGC 2460 

TATCCTGTTG AATGAGTTTC CAGTAGCGCT TG AT AG C C TT GTATTCGGGA TTTTCGATGA 2 52 0 

AACTGATTCA TGATTTGGAC ACGCACACGA CTCATAGCAC 2 5 60 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 115: 

TATTGGATTT CCCTTGCAAT CAGTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 12 0 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 18 0 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGGG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 3 00 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 3 60 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 42 0 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TG AT AAC G AA T AC CAT AC AG 4 80 
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AATCTTCATA TACACTAAAC AAATG ACT T T CTAATTATTT CAATTAGTTT TGGCTAGTAA 540 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 6 00 

AGTCATGAGA GTTTTTCGCA TTGATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 72 0 

CTTTCCATTC AAATTT AG C T GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 780 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GATACTAGAA TCTCCTTGTA 84 0 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 90 0 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGG TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 108 0 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 1140 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCTACAAT GGGAGCCACC ATCAGTTCTG 12 00 

TTCCAAAAAA GTATTGGTTT GGAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 1260 

AGATTGGACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GTATATAGAT 1320 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTTGTTTCTG 13 8 0 

AAAAAAACCA AGGTTCTTTA C T ATT AAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 1440 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCGTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 15 60 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 162 0 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 1740 

TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCTAGT TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA AT C AAAAAT A GCAGGTTCTT 18 60 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 192 0 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG CTTTCTATCA TGAAGTTGTT 198 0 

GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA AC T AT AT CC T GTCCAGCCAC 204 0 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT G CC AATCC AT AT C T AAC AC A CCGATAGATA 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACGTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2 220 

T CAAATGGT A AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AAGAAATACA 2 2 80 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 2 340 

AATCATCCAA TAAGGCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 2400 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACGT TTCAATAGAT TC AC CAAAAT 2460 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 252 0 

TAAATTCTCC TTTATTAAAG TGCAGATGAA AATAGTCCGT CATAATATCT AGTACGTTTG 2580 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATTATAGATA AGTTGTGTCG 2 640 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACCT TACTAGCTTG TCTGTTAATA 27 00 

C AG AG ATT CG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGGTA 2760 

GCATAAAAAC AGAACGCACC ATTTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 2 82 0 

ATCTGTTATA TCTATGACAA ATAATAGTCA ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2 8 80 

TTAAGAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2 940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3 000 

TCTTACTAGC AATCATACGA TATTCAAGCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 3 0 60 

TAGAACTGAA GAACCCGGAT CGGTATATAA ATTATCCGGA TCAACATAGT CATAAGATTC 312 0 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

AC AT C ACT AA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC ATACCAAACA 32 40 

CTCTTAAAAA CGGATCGGTT TTCAAAAGCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 33 00 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 3 3 60 

TAAAAATTTT ATTTTGTTTG TCATATCTAA GTTTTCAGCA CGCTTATCCT ATTTTATAAG 342 0 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTGCTTTTTT ACTATTGTAT CGTATTCTAC 3 4 80 

GATAACATAC TTTACTTTAT TGTTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 3 540 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3600 

AAGAATATCT TAAAAAGACC TCTTCACTGC TAATATTAAC ACTCATTATT TAAACTATAT 3 660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 372 0 

TTTTGATCAT ATACACGGAT GAATGGAAGA TAGACTAGGA ATGCTGCAAA TG CACAT ACT 37 80 

AGAGCAACTA ATACAGCTCG AAGATCTGCT GTCCCTAAGA AAGCTCCAAT CCCTACTGGA 3840 

GTTGGCCATG GAACCTGTGC GATAATTGGC TTAATAAAGT T T AG AG AATT CGCTACGTAA 3 900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC TGGATTATAG 3960 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 402 0 
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GCTCGTCCTA TTGCTTTAAG CTGTTCAGAT TTAGAGGCAA AAGCAATATA TAAACATAGT 4080 

CCTAAAGTTG CACCAGAACC ACCTGCAATT ACAAACATAT T AG AAAATT C ACCTGCAACA 4140 

GCGAAGTGCC CGCCAGCAGC ATTTTCAGCC ATGTTAGCAA GAGCAATTGG ACTAACAAAT 4200 

GCAAAAACAA TGTTCGCACC GT GG AT AC CT ACAATCCAAA GTAGTTGAGT CAATAGATAA 42 60 

ATAATCATTA AACCAATCCA CGAATTAGTC AGATTGGATA CAAAACCAAA TGGAATTGCA 432 0 

ATGACTTTAA AAATATCTGT TCCCATTGCT ACAAGAAGAC C GT TG AT AAA GATAACAACA 4380 

AATGCAACAA CAAATCCCGG AACCAAAGCG GTAAATCCAC GAGAAACTCC TTCTGGAACA 4440 

GCTTCAGGCA TTTTAATAAC CCAATTATGT TTAACACACA TACGATAAAT AAGAACAGTC 4500 

ACAATTGCCA TAATGATTGC GGT AAAAAT C CCTGTTGTCC CAAAACGTGC GACTACATTT 45 60 

CCCATTGCCC ATCCATCTGC AATTACTGCA CCTTCTTTTA GACTTGTCAC AGTCTTCATC 4 62 0 

ATTCCACCAT CAAAAATGAT TTGCGGTACT GTCATGACAA AAGCCATCAA GGCAAGCAAG 4680 

GCACCATTAA GAGGATTCAT ATTGAGTTCT TCTTCCTCTG CATAAATTTT TGTCAATTCA 4740 

TATGCAAGTG ATAGAACGAA ATAAAGAGAT AGAGAACCCA TAGTCGCATA GTTTGCAACC 4 800 

ATGTAAAGTG ATGTGAATTT ATCAAATGAA GCAGAGAAAA TATCTGCCAC AATTGGCCAA 4 8 60 

AATGAGAAAG CTTGTGGCAA AATACTGAAT ACCAAAAACA TTGATCCTAC AATAGTAAAT 4 920 

GGTACAGCAG CCATACCTGC AGCCGTGATA GCACGTACTA CTTTAAACTG AGCAAGTTTG 49 80 

CCCATTGGTC C C AT AAC AT G GTTTTCAAGA AAACCAAACA ACCCGTTTTG TTG AT CC AT A 504 0 

AATAGACCTC CTTAATAAAA CATAATAATT TTTACTTTCT AAAGACTAGT TTCAAATACA 5100 

AATTATACTA GATCAGGATT ATAAACTAAG TGAGTTCTTT TCCAATTGGA CAAATTGTTG 5160 

ATAAGCCTTA TCTGTTCGTT TATAAATTTT TTTAATTCTT CTAATGTCTA ACAAACTCAG 5220 

AACTAAACCT AATAGAAGAA CTACAAAAAC AAATAAACGT GCTACTTGGT TATTTTCAAA 52 80 

AATCGGAAAA AGATTCTTAA ACCAACTTGT CCAAGTTAAA ACAAGTAATC CTATTGAAAT 534 0 

AAGCATTTGT ATTCTAACAA ACATTAGTGT TATTCCCAAC TTTTCTTTCC TATTTCCATA 5400 

AAGTTTAAAT TGTTCAACAG TTGCTAAAAT AGAAAATACT ATGAGCATAA TGGGGAAAAT 54 60 

AATAATAGGC GAGGGACTAA T AAACTG AC T CAAAAGCCAA TAAATATTCC CAAAAAAGAA 552 0 

GAGTGCTATT GAATAACGTA GAAGAAGATA T CG ATTG AAA AAAGTATTAG TTAGAGCCAT 55 80 

CTCTCGACGT TGTTGTTCAA TCTTTTGTCG TTCTTTTTTA TCCATATCAT TTCCTCCTTA 5640 

TATAACAACA CATATTTAGT TAACTTTCTT ATAAAGAGCT AACATTTCCT TTGCTACTTC 5700 

TAATAATGTC ATAGTGGTCA TTAAATGATC TTGAGCATGT ACCATGATAA TTTCAATTTT 57 60 

AATTTCCACT CCACTTGCGT ATTCTTGCAA GAGTTTGGTT TGTGCATGAT GCGCTTCAAG 582 0 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 588 0 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATC ATT CTC 60 60 

TGT C AG AG AG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 612 0 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACGAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 63 60 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT AC TGT CT AAA 642 0 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA T AAC TAG AAA CTTGCATCTA 64 80 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT GACACTATAA AATCTATTCC 6600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GC AC C AAATC CTGTTGCACA 672 0 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC T AAACGT AC T TGT AGTGT C A TTAGATGAGT 69 60 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT T C AGAAAT AT TCTTCTGATT 7080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA ATGTAGTCTA TTTCCTGAAC 714 0 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 72 00 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TAT ATT T T TT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 732 0 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7 5 60 
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ATCATGAGAT AATGTTGAAC GACTAACGTA AAGTTCATCA GCTAAATCAT CAAAAAGAAC 7 62 0 

TGGAACTTGC TCAAATAATA ATTTATTTAA GATAAATACT AAACGATCAT CACCTTTTGA 7680 

AACCGCAGTT TTCGTATAGT CTTCTTCCAG TTCATAAGTT TGTCTAAACT CCTGGTAAGC 7740 

GCCTTGATTC TCAAAAAATA TTTGATACCC TTGACCTTGT TTTGAAATCA ACCGGACTCC 7 800 

TTGAATAATC ATTGTCTTCT CAATTAATTT CAGTACATTA CGGACAGTTC TATCTGAACA 7860 

GGATAAATAT TCTGCCAGTT CTTTGCTTGT AACAAAACGT TCCTTATTTT TTATTAAAAA 7 920 

TTGAAGGATA TCTTTCTCTT TAATGTTTAA CACATTCATT CCCTCCTAAA ACGTATGTTT 7 980 

TCATATATTG AAGCATATTA TACACTTAAA TCAGTTTATA TCAAACTCAA AACAATTTAT 8040 

CTTAACCTAA ATATTTATTG ACATTTCATG TGTTCATCAA AT ATT CT C AA GAATCAAATT 8100 

AGCCATTTTT TCAATTCCCA TTGGAATAGG AATATAGGCT TGAGGAGGTA TTTGTACAAC 8160 

TGGTTTTCCT GCTTTAGAAC CAGCCTCTTC AAATTGCTTA AAGTACATTT TTGTTTGAGG 8220 

ACTGACAAGA TACAAATCAA AAGCTGCTGC TGCGATAGCT TTCCCTCCTT CAGTAGCACT 82 80 

AATAGCATCA ACTACAATAT CTTTCCCTTT TCCTTTTAGA AACTCTGTTG TTTTCTGTGC 834 0 

CATAAGTGAT GAAGACATTC CTGCTGCACA AAT AAT T AAA GCTTTTGCCA TAATATTTTC 84 00 

TCCTTTTCTT AAATCCAATC AAAGCTGTGC TAAGTTGGCT TATTTGTTAT CTATTTTTAT 8460 

TATAAAATAA AGCGTTTCCA ATGACAATTC CCTCATTTTC CTAAATGATA TGGAAAAAAA 852 0 

TT AT T TAT AC TTCAATTTAT AAAATAAAAT TATTCCTGAG AGTAGAAATG AAACACTATT 858 0 

TGCTAAAATC AAAGGCAAGT CTCCTATACG AAT AC C ATG A GCAAGCCACA ATGCAATACC 8 640 

AATAACTTGC ATAACATACA TACCTAG AG C AATAGATCCT GTGTCCTTTG TCTTAACTAC 8700 

ACGAAAAACT TGTGGTAAAA ATGCAAATGT TGTTAAAATT GCTGCAATAC TTCCAATCAT 87 60 

ATGTCACCTC AAT ATG CT AA ACAAACTGAG AATAATCTCA GTTTGTTTAT ACTATTCTAC 882 0 

TGATTCACCG TTAGATGAAA TAACTTCCTT ATACCAGCCA AAAGATTTTT TCGGGGAACG 8 880 

ATTATAACTT CCCTTCCCAT TATCATCTTT ATCTACATAA ATAAAGC C AT AACGTTTCCG 8940 

CATTTC AC CG GTACCAGCTG AAACCAAATC AAT AC AT CC C CATGGAGTAT AACCCATTAA 9000 

ATCAACACCA TCTTCAACTA CAGCCTTTTT CATTTCACGA ATATGGGCAC C TAG AT ATT C 9060 

AAT TCTAT AA TC AT CATGTA CCATACCATC TGCTGCAACT TGATCTATAG CTCCAAAACC 9120 

ATTTTCAACA ATAAAGAGTG GTAAGTGATA GTGGTCTGTA AACCAATTTA ACGCATAACG 9180 

CAAACCTTCT GGATCAATTT GCCACTCCCA TTCAGAAGCC TTAACATAAT TATTTTTCAC 9240 

TAAATCTTCT GTTTCAAGAT AAT C AAAAT A AGGATTATTT TCACGATGAG AGTCG AT AG C 9300 

AAAGGACATA TAGTAACTGA AACCAATGTA ATCTACAGTC CCACCAAGTA AATCTTCTTT 93 60 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 942 0 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CGCTTCTGCA TAGCTTTCAT 94 80 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA AT T GG AC AC A T CGC AAT CAT 9540 

ACAACCTATT TGAAAATCTG GATTAATCTC AT G AC C AATT TTTACAGCTC GTGCAGAAGC 9 600 

AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 96 60 

T AC AAT ACC T GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 972 0 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 97 8 0 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CC AC CAT ATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9 960 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 1002 0 

AAAAAGTGCT ATATCTTCTT TATAACGGTG AT AAAAAT C T ATCGCCTCAT GATTTGGATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 1014 0 

C AT AAC AT C A GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 102 00 

AGCAGCAACA GCACCACCCC AT AAAAAT C C ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 10260 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 10380 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AAT AATT C T A 1044 0 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGCATC C C AG AATC AT CTTCTATATA 10 500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 105 60 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10 62 0 

AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC AAGTTACCTT TTTGATTTCT 106 80 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 

CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC 108 60 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 1092 0 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTGCTTTTTG ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 11160 

AATGTTGGTA AG AGAAACT T CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 112 2 0 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 112 8 0 

TGATATCTTC TTCCAGATAC TTT 113 03 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT G C AAG AT AAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 18 0 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 3 00 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 3 60 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 42 0 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AG AT G AC CG T GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 6 00 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 72 0 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 78 0 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG 840 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 9 00 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 9 60 

CC AT C C AAG A ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 102 0 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 108 0 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG C C AT AGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAG CTTCCTATAC TTTCTTCGCT GTTTGTAAAT CAAAATGCAA GACACATGAG 12 60 

TAGCACCATA TTTGTTACTC TTATCTGTCC TCTCAAGAGA CTATTATGAG TTATTTCAGA 132 0 

ATCATTCACT ACTTTGACCC TGACTCTCCT TAGTCTCAAA ATCAAAGACT TATACTCTTC 13 80 

AAAAATCTCT TCAAACCGCG TCAACGTCAC CTTGGATTAT ATATGTGatC TGaCTTCGTC 144 0 

AGTTCTATCT ACAACCTCAA AGCAGTACTT TGAGCAACCT GCGACTAGTT TTCTAGTTTG 1500 

CTCTTTGATT TTCATTGAGT ATTAAACAAA AAGTGAACAA ATCTGAATTC TAATGTACAG 15 60 

AAGACTAGGC TTGTTCACTT TTTTATAGTC GCTATAAGAT GACCTTATCT ATAGCTTTTT 162 0 

ATATATAATT AT AT ATT C AG ACATACTATT ATCAATTTTG TCGCAGGGAG GAATCTGTTA 16 80 

ACGCACCCAT T C ACC ATT AT CATTGACTCT AT AG C CAT G T ATACTTGTAT TGACCGCTAA 1740 

CTCACCCGAT GTATTTACAT AATACCATTT ACCACCAACT TGGAACCATT GATTGACTTT 18 00 

C AT AG AAC C G TTGCTGTTGA GGTAGTACCA TGAACTATTA ACTTGTACCC AACCTGTTGC 18 60 

CATGGAACCA TCAGTATTAT AAAAAT AC C A CAT ACC AT TT TCTTGTTTCC AGTCTGTTGT 192 0 

TGGAGCAACT GCTTTAGCTG GTTCTACTGC T AC AT CTGT T CCTTGGTTAG ATGTAACAGA 1980 

TACAGGATAC GAAGGAATAG ATGATTGCTC AGGAACAACA ACTTTTTCAG GTTCTCTCGT 204 0 

CCCTCTCCTT ATACGTCTTT TTACCATCTC TTTAGTAATT TGACGAGAAG TAGTTTCTTC 2100 

AATTGTTCCA TCACGTTCAT CTACAGTATA GATTGTAGTA AGAGTAATTT ACCAATTTCT 2160 

CCTACTTCTT CTACTTCTTG ACTTTTATCA AGAGTTGGGC CATCGAGATA TTCTGTTTCG 2220 

ATTGGAATTT CTTGGACAAG AACTTGGGGC TTGGTTCTTT TTTTAACAAC TCTTGTTTGA 22 80 

GAGTCTTTTT TTTGACTTAA AGTACTCTCA GTTACTTGTC CACTCTTTCC AT CT AC ATT A 234 0 

TAAGTTATCG TTGTAACTGT TTTCCCATTC TTTCCTAGAG TAATCTCTTG CTCCTGTCCT 24 00 

GCAGAAAGGT CATTGTCTGC TTCATATTTA GTAGCAAATG GAACAAGAAC TTCTTCAACC 2 4 60 

TTGCTTTTAG CTGGAACTTT GATAACTGTA TCCGTGGCTT CTTTTCTATC AACAGTAACC 2520 

TGTTCGGTAA CATAACCAGT CTCTGGATTA ACATCGTAGG TCCTTGTCGT AGTTACATAG 258 0 

CCATCCTCTC CATC AAT TGT AACAGGATTT TCACTACGGT CTTTTGTTTC ATCTTTTTCA 2 640 

TAACGAATTC GCGTACTTGA AATTTTCTTG GTTACTACCT TAGGTTTAGT CGCTACTTTT 2700 

ACAATAATAT CCCCATTGTC AGCGTCATCA TACTCTATTC CCTCTTCTTT ATCTCTAGTA 2760 

TCATCTCTGA CATATTGAAT CCCATCAGCA GCATGAACAA AACTTGTATT CAGATTCCTC 2820 

CTAAAAATAA AGTTAGCCCG AT T AC CGC AG AACCAAAAAT CTTTCCGAGT TTACGTATTG 2 880 

CATAGCGCTT ATTAGTATTA GATTTTGCCA TT AC AT C CT A CTTCTAGTAT AGCATCTTTT 294 0 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3 000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3 060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT AC AAAC AT T C CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 3 00 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 3 60 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 4 80 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 54 0 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TG AT TAT ATT TC C CAT TATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 6 60 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTC CATC ATT TGCTTATAAA GAGATATTTT 72 0 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAAT TGGC A CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTT A 840 

AATGAACTGT AGTAAAAGAA AGT T AAT AAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 9 60 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 102 0 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 12 60 
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TGATAGAAAA AGAAGATGTT GAAACCAATG CTTCAAATGG TCAGAGAGTT GAT T TAT C AA 132 0 

GTGAACTAGA TAAACTAAAG AAACTTGAAA ACGCAACAGT TCACATGGAG TTTAAGCCAG 13 8 0 

ATGCCAAGGC CCCAGCATTC TATAATCTCT TTTCTGTGTC AAGTGCTACT AAAAAAGATG 144 0 

AGTACTTCAC TATGGCAGTT TACAATAATA CTGCTACTCT AGAGGGGCGT GGTTCGGATG 1500 

GGAAACAGTT TTACAATAAT TACAACGATG CACCCTTAAA AGTTAAACCA GGTCAGTGGA 1560 

ATTCTGTGAC TTTCACAGTT G AAAAACCG A CAGCAGAACT ACCTAAAGGC CGAGTGCGCC 162 0 

TCTACGTAAA CGGGGTATTA TCTCGAACAA GTCTGAGATC TGGCAATTTC ATTAAAGATA 16 80 

TGCCAGATGT AACGCATGTG CAAATCGGAG CAACCAAGCG TGCCAACAAT ACGGTTTGGG 1740 

GGTCAAATCT ACAGATTCGG AATCTCACTG TGTATAATCG TGCTTTAACA CCAGAAGAGG 1800 

TACAAAAACG TAGTCAACTT TTTAAACGCT CAGATTTAGA AAAAAAACTA CCTGAAGGAG 18 60 

CGGCTTTAAC AG AG AAAAC G GACATATTCG AAAGCGGGCG TAACGGTAAC CCAAATAAAG 192 0 

ATGGAATCAA GAGTTATCGT ATTCCAGCAC TTCTCAAGAC AGATAAAGGA ACTTTGATCG 198 0 

CAGGTGCAGA TGAACGCCGT CTCCATTCGA GTGACTGGGG TGATATCGGT ATGGTCATCA 2 040 

GACGTAGTGA AGATAATGGT AAAACTTGGG GTGACCGAGT AACCATTACC AACTTACGTG 2100 

AC AAT CC AAA AGCTTCTGAC CCATCGATCG GTTCACCAGT GAATATCGAT ATGGTGTTGG 2160 

TTCAAGATCC TGAAACCAAA CGAATCTTTT CTATCTATGA CATGTTCCCA GAAGGGAAGG 2 2 20 

GAATCTTTGG AATGTCTTCA CAAAAAGAAG AAGCCTACAA AAAAATCGAT GGAAAAACCT 2 2 80 

ATCAAATCCT CTACCGTGAA GGAGAAAAGG GAGCTTATAC C ATT CG AG AA AATGGTACTG 2 340 

T CT AT AC AC C AG AT GGT AAG GCGACAGACT ATCGCGTTGT TGTAGATCCT GTTAAACCAG 24 00 

CCTATAGCGA CAAGGGTGAT C T AT AC AAGG GTGACCAATT ACTAGGAAAT ATCTACTTCA 2 4 60 

CAACAAACAA AACTTCTCCA TTTAGAATTG C C AAGG AT AG CTATCTATGG ATGTCCTACA 2 52 0 

GTGATGACGA CGGGAAGACA TGGTCAGCTC CT C AAG AT AT TACTCCGATG GTCAAAGCCG 2 580 

ATTGGATGAA ATTCTTGGGT GTAGGTCCTG GAACAGGAAT TGTACTTCGG AATGGGCCTC 2 640 

ACAAGGGACG GATTTTGATA CCGGTTTATA CG AC T AAT AA TGTATCTCAC TTAGATGGCT 2 700 

CGCAATCTTC TCGTGTCATC TATTCAGATG AT C ATGG AAA AACTTGGCAT GCTGGAGAAG 27 60 

CGGTCAACGA TAACCGTCAG GTAGACGGTC AAAAG AT CCA CTCTTCTACG AT G AAC AAT A 2820 

GACGTGCGCA AAATACAGAA TCAACGGTGG TACAACTAAA CAATGGAGAT GTTAAACTCT 2880 

TTATGCGTGG TTTGACTGGA GATCTTCAGG TTGCTACAAG TAAAGACGGA GGAGTGACTT 294 0 

GGGAGAAGGA TATCAAACGT TATCCACAGG TTAAAGATGT CTATGTTCAA ATGTCTGCTA 3 000 
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TCCATACGAT GCACGAAGGA AAAGAATACA T CAT C CT C AG TAATGCAGGT GGACCGAAAC 3 060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3 3 00 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 3 3 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3 420 

ACCCAGTATG AT AG C AAG AC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 34 80 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3 540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3 600 

TTTACAGGGG GAGTTAATGG TAC AG AG CCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 372 0 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG G AG AGTG AC C TCCTAGCTTC ACTAGGACTA 3 780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3 84 0 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT 3 900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3 9 60 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4 02 0 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4 080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGG AT G A ACCTGCTTTA 414 0 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 42 00 

AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4 2 60 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4 320 

GTTTACG 43 27 
(2) INFORMATION FOR SEQ ID NO : 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 118: 

CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 6 0 
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TT AG AT AT C A AACGTATTCG TACAGATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 120 

GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCGTAACATC 180 

TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 240 

GCTAAGCGCA ACAAGGAAAA TACAGATGAC AAGATTGCTG CCATGCAAAA TCTATCTGCT 300 

GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG CTAAATTGAC AGAATTTACA 3 60 

ACGACTCTTC CAAATATCCC AGCTGACAGC GTTCCTGTTG GGGCTGACGA AGACGACAAT 42 0 

GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT TCGAACCTAA AGCTCACTGG 4 80 

GATCTCGGTG AAGACCTTGG TAT CCTTG AC TGGGAACGCG GTGGTAAGGT AACAGGCGCT 54 0 

CGCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC GTGCTATCTA CAACTTTATG 600 

TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA CACCTTACAT AGTCAACCAT 660 

GATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG AAGATACTTT TGAACTCAGC 72 0 

GATACCAACT TTGTCTTGAT TCCAACTGCT GAAGTTCCTC TGACAAACTA CTACCGTGAT 7 80 

GAAATCTTAG ACGGCAAAGA TCTTCCAATC TACTTCACTG CCATGAGTCC GTCATTCCGT 84 0 

TCTGAGGCTG GTTCTGCCGG TCGTGATACG CGTGGCTTGA TCCGTTTGCA CCAATTCCAC 9 00 

AAGGTTGAAA TGGTCAAATT TGCCAAACCA GAAGAATCTT ACGAAGAATT GGAAAAAATG 9 60 

AC AG C C AACG CTGAAAACAT TCTTCAAAAA CTCAACCTTC CATACCGTGT CGTTGCTCTC 1020 

TCTACTGGAG AT AT GGGCTT CTCAGCTGCG AAGACTTACG ACTTGGAAGT GTGGATTCCA 1080 

G C AC AAAAC A ATTACCGTGA AATCTCAAGC TGTTCAAACA CAGAAGATTT CCAAGCCCGT 1140 

CGTGCCCAAA TCCGTTACCG TGATGAAGCA GATGGCAAGG TGAAACTCCT TCATACCTTG 12 00 

AACGGTTCTG GACTTGCAGT TGGACGTACA GTGGCTGCAA TTCTTGAAAA TTACCAAAAT 1260 

GAAGATGGTT CTGTGACCAT CCCAGAAGCA CTTCGTCCAT ACATGGGTGG AGCTGAAGTC 13 20 

ATCAAACCAT AAAAAATAAG GTTTAGCTAT TTCTAGCTAG ACCTTTTTTC GTAACCAAAT 13 80 

CAGATAAGCA CCTAGTACAA AGAATAAAAT AGTTAGGCAT ATAATGGTTT CAGCCAATAC 1440 

CAGGTAATCC AGAAATGGAA GTTTCAAAAT TCCCTGAGCC ATCTTGAGCG AGGTCGCTGT 1500 

GATAATGGTT GGGAAGGTGA GGGCTGAGAA GGCTGGTTGA AAACCTTGTT TTAAAATGTT 1560 

GGGCAGACGA GTTAAAACAA AGAAAAAGAA GGATTGAGAA GCCAAAATCA TGACAATCAA 1620 

GACCCAAGTC GGCAGGCTGG TTCCTCCTAC TCGAACTAGA GAAGCCAAGA GTAGAGAGAA 1680 

AGGAGCACAG T AG ATT C CT T CTTGTCCAAG CAAGGCTAGT GGGAGTGGAT GTTTCTTTAA 174 0 

ATCGCTATAA ATAAGGGGAT AGAGATAGAA GGTCAAGAGA AAACC AAAAC TCAAGGTCGC 18 00 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

AT AG AG AAC C GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 192 0 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 19 80 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC G AT AAAAT C A TCCCAGCCAT 2 04 0 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AG AT GC AGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA T C AG AC T AAA 2160 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 2220 

CAAACAACCT G AAAAT ACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2 2 80 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 2340 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 24 00 

ATTAGCTAAG TTGTTGGCAT AGT T AC AG AG GATTGCTAGG C T G AC AC C AA AAACCACATC 2 4 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 252 0 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GG AGT AC GAT TGCGAAGAGC TTGAATCAAG T CAT CAT T C A TCTGGATTTG 2 64 0 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 2 700 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 2 7 60 

GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGG 2 82 0 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2 8 80 

TGACTCTACT GTATGGATAG TAAATGTTGT CATAAGATAC CTCTTTTCTT ATTATTGACA 2 940 

CTAATATTAT TGGAAAATCT T AT AAAATC C TGATTCCTAA GTTTATCTAA GATAAAGCTT 3 000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3 060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 312 0 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACGACT GGCGACAAGA 3180 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 33 00 

AGAGCCAGAT AAAGCGAAC C C CAAAGAGG A ACTCAAAACA GCGTTCTCCG TAATAGTTCC 33 60 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 342 0 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 3 4 80 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3 521 
(2) INFORMATION FOR SEQ ID NO : 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 68 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 119: 

AACCTGGGCA AGCAAGCTAA AAGCAATGGG ACCTGGAATC CTAATGGCAA CTGCCGCTGT 6 0 

TGGAGGTTCC CACATTGTAT CCTCAACTCA AGCTGGCGGT TCTTACGGTT GGTCTCTACT 120 

TCTCTTGGTC ATCTTAGCCA ATGTCTTTAA AT AT C C ATT T TTCCGTTTTG GTGCTGAATA 180 

CACAGCTGAT ACTGGAAAGA CTTTGGTTGA AGGTTATGCC GAAAAAGGAA AACTCTATCT 24 0 

CTGGATTTTC TTTATCCTCA ATGTCTTTTC GGCTATGGTC AACACGGCTG GTGTTGCCAT 3 00 

TCTGTGCTCA GCTATCATCG CCAGTGCCTT CCCAATGATT GGACTTAGCA TTACTCAGTG 3 60 

GTCCCTCATT CTCGTTGCAA TCATTTGGGC TAT GC TACT C TTTGGAGGCT ACAAACTTTT 420 

AGACGGCATG GTCAAATGGA TTATGTCTGC CTTAACCATT GCGACTGTTC TTGCAGTTAT 480 

CATTGCGGCG GTCAAGCATC CAGAATACAG TTCTGATTTT GTCGAGAAGA CACCTTGGCA 54 0 

AATGGCAGCT CTGCCCTTCA TCGTCTCCCT CCTAGGATGG ATGCCGGCTC CTATTGAAAT 600 

TTCAGCCATC AATTCACTTT GGTCAGCTGA AAAGAGAAAG ACCGTCAACT TTAACACAGA 6 60 

AGACGCTCTG TTTGACTTTA ACACTGGTTA TATTGGAACA GCTATCCTAG CCGTCTTCTT 720 

TGTGGCACTG GGAGCACTGA TTCAGTATCC TACAGGGCAG GCGGTTGAAG CTGCTTCAGC 7 80 

C AAAT AC AT C TCTCAATTCG TGGGCATGTA TGCCTCTGTT CTTGGCGAAT GGTCCCGTTA 84 0 

CTTGATTACC TTTATTGCCT TCCTCTGTAT CTTTGGAACA GTTATAACTG TTATCGATGG 9 00 

CTATTCTCGC GTTAATCAGG AATCTCTCCG ACTGCTAATC AGTCAAAAAG AGGACAATCG 9 60 

TAAATCTTTG AACATCTGGA TGACCATCAC TGCTATCATC GGTATCGTCA TTATCAAGTT 102 0 

CTTCGCTGGT CAGGTTTCAA CCATGCTCCG CTTTGCCATG ATTGGCTCTT TCCTGACAAC 1080 

ACCTTTCTTT GCTCTTTTGA ATTACGCCTT GGTAACGCGT GAAAACAAAA ATCTTCCTTC 114 0 

TTGGCTCAAA CACCTTGCCA TTGCGGGATT GATTTTCCTC TTTGCTTCGC CATCTTCTTT 12 00 

ATCTACGCAC TCGCAATCGG AAAAGCAGGG TAAGGGACAA GCGCGAGATG AAGATAAGGT 12 60 

TTCATTTCAA GAGAAAATTC AGCAAATATT TCTATGATAA AAAGCATAAG AACAAGGTTT 1320 

TGAAGACCTG AACTTATGCT TTTTTACGTT CTTAAAGACT GTTTATACTC AAAAAACAGT 13 80 

TGAACAACTT CAACCACCTC TTATAAGAAC T TT AT ACT AT TCGAGAATCT CTTCAAACCA 1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 



1500 



TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 



1560 



TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 



1620 



CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 



1680 



TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC AT TGT T AC C A TAGGCAATAA 



1740 



AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 



1800 



CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT T T CAT AG AAA TTTTTGCCTT 



1860 



CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 



1920 



GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 



1968 



(2) INFORMATION FOR SEQ ID NO : 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 

CCGCATTTTT TATCACTAGA CTCGAGACAT CTTTTGAGTG GCTCTTGCTC TCTGGTTTAA 6 0 

TTTTCTTCCT TGCTCAAGGA CTCCTGCTAT TTCTCTTGGT CGTCCGACTC AAACATCAAT 12 0 

TCGCTGAGAT TTATCCTCAA ATCAATAAAA AGATTCGCTT CTACTATTTA GGGGTTCTCA 180 

CCATTGATTT TCTATTTTTT GTTCTCTTAG C CT T C ATT AG TTCTCAGCGT TTTTCATCTC 24 0 

TTATGCCAAT CATCACTGCT TGCCATTCTA CTTTTTATTA TAT G AC AG C T G ACT AC C T AA 300 

GAGAAAACTA TCCAGACTTT TACGACAAAC ACATCTCTTT ATGGGAGTGT CTCTAAAGAA 360 

AAGGAGGTTT TAG C ATG AAA AAAATC AT CT TCATCAAAAC CATTCAACTC CTTGTCATTG 42 0 

ATGGAATCAT GCTGGCATTT TTGACATTTA AAAGGGGGCT TACTTGGGAC TGGATTTTGA 48 0 

TTTATAGCGG TTGGCTCATT TTCTTTCATC CTGTGCTATT GACCTATCTT TCAAACCAAC 54 0 

TTTGTGACCA CTTTAGTTAA CTCTATTCCC AGATTAGACC GAGATTCTGG CGTTTTGCTT 600 

TACAAATTCT CCTATGGGAT AGCCTGATGA TTCTCTCCTT GGTGTCTTTA AGTGATATTC 660 

CACTTTTCCT TCAGGGAACT CTCCTCATCC TAGGACATCT CATCCCTTCC TATCGCATCT 72 0 

GCCAAAGCCT GAAAAGAGAC TTCCCCCAAG CATATCAAGA ACCGATTTCT TTTTGGAGTA 780 

TTTTATGATA GATGAGAAAG ACCAAGCCGA CTGGGCTTGG TCTTTCTTAT CTCTTTTTAG 84 0 

TATCTAGGAT AATGGTAACA GGTCCATTAT TAACCAGCTC AACCTGCATA TCTGCTCCAA 90 0 
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ALjAI (jLLTGT 


CTGAACGGGC 


ACTTCTTGCG 


CTAATTTTTG 


ATTGAAAGCA 


TCATAGAAGT 


960 


rn 7\ rn /"■■ 7\ m 


ATCAGGTTTA 


GCTGCCCCTG 


TAAAGgCTGG 


ACGATTGCCT 


CTCTTAGTAT 


1020 


LLOLAAAGAG 


GGTAAACTGA 


GAAATAGAGA 


GGATTTCTCC 


TTCAATATCT 


TTGACAGACA 


1080 


GGTTCATCTT 


GCCTTCTGCG 


TCTGAAAAAA 


TCCGCATATT 


GACCAGTTTT 


CTCACAGCAT 


1140 


AGTCCAAATC 


TTCCTCTTGG 


TCCTCTGGTC 


CAACACCAAC 


CAGCAATAAA 


AGTCCCTGAT 


1200 


TGATTTTTCC 


CTGAATCTGG 


CCTT C TAT AC 


TCACTTGGGC 


TTTTTTAACC 


CGTTGGATAA 


1260 


TGATTTTCAT 


AATAGCCTTT 


CTAGTAAGAG 


CTAGGACAAC 


TAGCCGTTGG 


TCCGTTTGAC 


1320 


AGAGTAAAC T 


TCTGGCACAC 


TCTTAATTTT 


ATCGACAACC 


GTGGTCAGTG 


TAGAGAGGTT 


1380 


GGCAATACCG 


AAGgACACAT 


GGATATTAGC 


AAACTTCATA 


TCCTTGGTTG 


GTTGGGCATT 


1440 


GACCGTTGAA 


ATATTCTTGG 


TTGTATTTGA 


AAGAACTTGC 


AGTACATCGT 


TCAACAGTCC 


1500 


TGTACGGTTG 


AG AC CGT AG A 


TATCGATATG 


GGCCATATAC 


TCCTTATTTG 


AGCTAGGGTA 


1560 


CTGGTCTTCC 


CATTCCACAT 


CAAGGAGACG 


TTGCTCGTAG 


TTTTCTTGGG 


CACGCAGGTT 


1620 


CATACAGTCC 


ACACGGTGAA 


TAGCCACACC 


ACGACCCTTG 


GTAATGTAGC 


CAACAATATC 


1680 


GTCACCAGGC 


ACGGGGTTAC 


AAC ACT T AG C 


AATCCGCACT 


AGGAGACCAG 


AAGCACCTTC 


1740 


AATAACCACT 


CCCCCCTCAT 


GCTTGACCTT 


GAGGGTTTCT 


TTATTTTCAA 


CCTTGACCTC 


1800 


GCCACCTTTG 


ACAAGCTCCT 


CTGCCTCAGC 


TTTGGCCTTG 


GCACGCTCTT 


CCTCACGGCG 


1860 


TTCCTTTTCA 


GTCAGACGGT 


TAAAGACGGT 


AATCGCACCG 


ATTTCCCCAA 


AACCAATGGC 


1920 


CGCAAAGAGG 


GAGTCTTCTG 


TCTTGTAACT 


GGTCTTTTGC 


AGAACTTGAT 


CCATGTGGCG 


1980 


CTTGTCCATA 


AATTTATTTG 


CCACATAGCC 


ATTTTCTTGG 


AACTGAGCCA 


TCAGCATCTC 


2040 


ACGACCCTTG 


T TGAC AG AC A 


ATTCCTTATC 


TTGGTTTTTA 


AAGAACTGGC 


GAATCTTATT 


2100 


GCGCGCCTTG 


CTAGTCTTGA 


CCATATTGAG 


CCAGTCACGG 


CTAGGTCCAA 


AGGAGTTCGG 


2160 


GTTGGCGATA 


ATTTCAACCT 


GATCCCCTGT 


CTTTAACTTG 


GTTGTCAGTG 


GAACCATGCG 


2220 


GCCATTGACC 


TTGGCACCAG 


TTGCTTTTTC 


ACCGACCTTG 


GTATGGATTT 


CGTAGGCAAA 


2280 


ATCAATCGGT 


CCTGAATCTT 


TGGGAAGGGA 


ACGGACAGCT 


CCATCTGGGG 


TAAAAACGTA 


2340 


AATCTCCTCA 


GCCAAATAGT 


TTTCCTTAAC 


AGAGTCCACA 


AATTCCTTAG 


CATCATCAGC 


2400 


CTGGTCTTGG 


AGCTCCATCA 


TCTCCTTGAT 


CCAGTTCATT 


CCAATAGCTG 


ATTCCTTGCT 


2460 


GTTAACTTGC 


CCCTTTATAC 


CTTTCTTATA 


AGCCCAGTGA 


GCCGCAACCC 


CGTACTCAGC 


2520 


CACCTCGTGC 


ATTTCCTTGG 


TTCGAATCTG 


GAATTCAATC 


GGCCCTTTTG 


GTCCATAAAC 


2580 


AGTCGTATGG 


ATAGACTGAT 


AACCATTGGC 


CTTGCGGTTG 


GCGATATAGT 


CTTTGAAGCG 


2640 
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ACCTGGCATC 


GGTTTCCAAA 


ATTCATGCAC 


GTAACCAAGC 


ATGGCATAAA 


CATCACTTTG 


2700 


GGTATCTAAA 


ATACAACGAA 


TAGCAATCAG 


ATCATAGATT 


TCCTCAAACC 


GTTTTCTCTT 


2760 


GTCCTGCATT 


TTGCGGAAAA 


TTGAGTAAAT 


ATGCTTGGGA 


CGACCATAAA 


TCTTCCCTTT 


2820 


CAAGTGACGT 


TCTGTCGTAT 


ACTCCTCTAA 


TTTTGTGACT 


ACCTCATCCA 


CCAAGGCCTC 


2880 


ACGCTCCCTG 


CGCTTTTCCT 


T CAT C AT ATG 


GGTAATCTTG 


TAAAACTCCG 


TTGGATTGAG 


2940 


ATAACGGAAA 


GACAAGTCTT 


CTAATTCCCA 


TTTGACACTG 


GAAATCCCCA 


AACGATGGGC 


3000 


AAGCGGGGCA 


TAGATTTCCA 


TGGTTTCTTT 


GGAAATACGC 


TCCTGCTTGT 


CTTTTCGAAG 


3060 


ATGTTTCAGG 


GTCCGCATAT 


TGTGCAAGCG 


GTCAGACAGT 


TTGACCAAAA 


TAACGCGGAT 


3120 


GT C CT C AG AC 


ATGGCCATGA 


GCATCTTGCG 


ATGATTTTCC 


GCTAATTGCT 


CCTCGATCGA 


3180 


TTTGTACTCG 


ACCTTGCCAA 


GCTTGGTAAC 


TCCGTCAACA 


ATCATCCGCA 


CATCAGGACC 


3240 


AAACTCTCTT 


TCCAAATCGT 


CCAAAGTCGC 


ATCTGTATCT 


TCCACCACAT 


CATGCAAGAA 


3300 


TCCACAAGCT 


ACTGTTACAG 


CATCCAGCTT 


TAGCTTAGCT 


AAAATACCTG 


C C ACT TGG AT 


3360 


AGGGTGAATG 


ATATAAGGCT 


CGCCTGATTT 


GCGATATTGA 


CCACTGTGGC 


ATTCAACAGC 


3420 


AT AG AC C AAG 


GCCTTATGGA 


CAAAATGAAC 


ATCCTCTTCC 


GTTAAATATT 


CTTTGGTTAA 


3480 


AG CG AC AACT 


TCTTCGCCTG 


TTAAATTCAC 


TTCTTTCGGC 


ATCTCTACTC 


TCCAATTCTT 


3540 


CCTACCATTT 


TATCACTTTT 


TTAAGAATAT 


G AAAAC T AG A 


TTGGAACAGA 


ATAAGAAAAA 


3600 


AATAATTCAA 


AATTGCTTGA 


TAATTCTGAA 


TTATTGGTCC 


GTAATATACT 


ACGAAGTTAG 


3660 


ATTTTAAACT 


TAGGTGATAG 


AAGGAGAGAT 


AGAAGAACGG 


AAACCATATT 


GTAACCCAAA 


3720 


GACTTTCTGA 


CTTCCCCAAT 


T C CAT TG AAG 


ATACGAAAGA 


TAAACGGTGG 


AACTCGTATC 


3780 


ACATACACTG 


GTACCTTGAC 


TGGATTTTGG 


AATTAATACT 


AAATGAAAAT 


CAAAGAGCAA 


3840 


ACTAGGAAAC 


TAGCCGCAGG 


TTACTCAAAG 


CACCGCTTTG 


AGGTTGCAGA 


TAAAGTTGAC 


3900 


GCGGTTTGAA 


GAGATTTTTG 


AAGAGTATAA 


AAATCCTCAA 


GATACTTTCT 


TCTATCCTTT 


3960 


AGTTTATAAG 


GAGAATACCT 


ATGAAAAAAA 


CTGCTATTTC 


TATCTTTGCT 


CTCCTAATGT 


4020 


TAGGAGTTTG 


CTGCCTGTTC 


CTATTCAGCC 


AGCAAAGCTA 


TAAAAAACAG 


TCGTTCAATA 


4080 


CTATGCTAAC 


GACCAGAACC 


TGCCCAGTAG 


GATAACTTAT 


AGTGAATATA 


GCGACAAATG 


4140 


AGAAGCCAAC 


TACGGTAGCA 


CTCTAAACAT 


CACGTCTATC 


AAACAAGCTA 


ATGACGGAGT 


4200 


TTATGCAACC 


TATGAAGGGC 


AATTGACACC 


TTTCCAATAT 


TGATAAATTG 


ATAACCAGCC 


4260 


TGTCTTCATC 


TAGTCATGCT 


GGTTTTTAAG 


TTCATTTTAA 


ATCCTTACCT 


ATTCTCCCTA 


4320 


ACTGTGCTAT 


ACTTAATTTA 


TACTCAATGA 


AAATCAAAGA 


GCAAACTAGA 


AAGCTAGCCG 


4380 


CAGGCTGTTC 


AAAGCACTGC 


TTTGAGGTTG 


CAGATAAAGT 


TGACGCGGTT 


TGAAGAG AT T 


4440 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 

GAATCCTGTT G CCT ATG AAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 45 60 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4 620 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4 6 80 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4 74 0 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 492 0 

TCTAGTCTTG ACGGGAG AT G CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4 980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 504 0 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 522 0 

AGATTTTTTG CAT T AC C AAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 52 80 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 534 0 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 54 60 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 552 0 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 55 80 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 564 0 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CAT G GGC AAT 57 00 

CACTGCCCCA ATACCACCGT AGT T AG C AG A AG AT G AC T G A TGCAAGTCAT AGAAAGGCGC 57 6 0 

CTGTAAAATG GCCGCTGGAA AG AC AAT C AG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 582 0 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 58 80 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAG C C AAT AT AAGGTTTGAT 6000 

C AC ATTG AG C TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 612 0 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 624 0 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTG AAAGATTGAC 63 00 

AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 63 60 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 642 0 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGAGAGCA CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 672 0 

C C AT AGAAC A TTGATTCTAG CAT C CAT AAA GTCTGGCGAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6 84 0 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAAT C AC G A ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6 960 

TTCTTCACCT G CT AACC ACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 702 0 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGT AT T T GCATTTATCT TAACAAAAAT CG 7172 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 12 0 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 18 0 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 3 00 

AGTTAGCACT TGGATGCATT GAGTGCTAAT T G AT C AG AC A GAGAGGAGTG ATGAGATGGT 3 60 

T AC AG AG CGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 42 0 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 4 80 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 6 60 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 7 80 

AC AT AC AGC C TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 9 60 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 12 00 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 12 60 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 13 2 0 

CCGTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 13 8 0 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 15 60 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 162 0 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 16 80 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 

ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA 192 0 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG T C AG AAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT T ACT AAC TCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2 22 0 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 2 2 80 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 234 0 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 24 00 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 24 60 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 25 8 0 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC C AG AT AC AG T TATCTCTATC AAATCTAAGA 2 640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 27 00 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 2 7 60 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA AC AAAAG AC G 2 82 0 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 288 0 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2 940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3 000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3 060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 312 0 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 3 24 0 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 3 3 00 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3 3 60 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AG C TGAAAC T GGTAAAGAAC 342 0 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3 540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3 600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3 660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA AC AAG AC TCT TGGACGCTTC CAATTGACTG 3 72 0 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 37 80 

ACGGT AT CGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3 840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GAT G C AG AAG 3 900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT G AAGT AG AC C 3 960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 402 0 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 42 00 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 42 60 

GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4 32 0 

GCTGATAACT AT AG AAC AT C AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4 3 80 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4 518 



(2) INFORMATION FOR SEQ ID NO : 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 12 0 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 24 0 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 3 00 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 3 60 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 6 60 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 72 0 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 840 
TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 
AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 9 60 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TG AT TG AC C C AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 1140 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 12 00 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 12 60 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 1320 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 13 80 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGGTGGCTTT CAGTAACTCA TATATGAAGA ATGAGCAGGT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG G AAAG AC AT T AGGAGCTCAA GCTGGTTCAT 16 80 

CTGGTTATGC GGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 17 4 0 

AAGCGAATCA ATACCAAACC TTTAATGAAG CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA AC TAT TAT T T AGAAGCAGAA GGTGTTTTAA 18 60 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 192 0 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 19 80 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 2 04 0 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA AT AT GG AAT A 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AAT TG AT T T G ACTTCCAAAA TTAAAATGTT 2 220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT AT C AAGGTTT 22 80 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2 340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 24 00 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 2 460 

GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC AC C TG AC AAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGCGAC 2 580 
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CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2 64 0 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAG AT T AC C T CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2 7 60 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2 82 0 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2 880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2 94 0 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3 000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AG T GAT AG C T GTTGCGTTCC ATTCTGTTGG 3 0 60 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 312 0 

AAGAGTAACA GAG AT T GT TT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 3 24 0 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3 300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 3 3 60 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3 42 0 

CGTTACCAAG TCAGTTGAAA CGATAGATTT G C AG AG AGCG GCATTTTCAG GAAGAGTTCC 3 4 80 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA G C AC C G ATT T GGTTACCTGA 3 54 0 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3 600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3 660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3 72 0 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 37 8 0 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3 84 0 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGC 3 900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3 9 60 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4 020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4 080 

GGCTGTGACC ATGATAC CTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 42 00 

AG AT TC AAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 42 60 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGCG 432 0 
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AACAACGTAG 


ATGT T GAT AC 


GGTTTGTACC 


AGCACCAACC 


AAGCCACGCA 


TACCTGCAGT 


4380 


ACCAAATTCA 


AGATTTGTAT 


AGAAGGCATC 


TTCCTTAGTT 


TTTTCGTCCA 


TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC 


AAAATCAACC 


CATTTCTGGT 


AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA 


AAACATTTAA 


TCAGTTTAAT 


TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT 


CGAACATCTC 


TTCAAACCAG 


GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATGT 


TGAGGCTAGG 


AAAAATTCAA 


TTTCAGTAAA 


AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC 


TTAGTTTTAA 


ACAAGCATAT 


TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC 


GAGCATTAGA 


TTACATCAAA 


GAGGGGCACA 


4800 


GCCATGTCGA 


GGCAGCCAAG 


TTTTTTGGTG 


TTGGCGTCAG 


AACTCTCTTC 


AC G T GGGAAA 


4860 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG 


GAAAAAGCGA 


GTCGTCAAAA 


ACCGAAAGAT 


4920 


TCCTTTAGAG 


GAATTGAAAG 


CCTTTGTAGA 


GGCTCATCCA 


GATGCTTTTT 


TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC 


AGTATGGGCA 


GCTTTAAAGC 


AGATTAAGGT 


5040 


CACTTTAAAA 


AAAGATGACG 


AGCTTTAAGG 


AACAAGACCC 


AGAAAAGTAG 


CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC 


ACCTGTTTAT 


AT TG AT G AAA 


CAGGAAT CGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC 


TCCTAGAGGG 


GAGAAAGTCT 


ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT 


TGTTGCAGGA 


CAAGTAGACG 


GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC 


AAGCGATTTC 


TTTGTGGAGT 


GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA 


TGTTATTGTC 


ATGGGCAATG 


CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGGATG 


AACTCTGCAT 


C C AAG AT AAA 


CACTTTTTCT 


TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA 


GCAAGCTTGG 


GCTATCTTGA 


AAAAGAAAGT 


5520 


GACGGATGTA 


TTAAGGGAAG 


TTCCAACTAT 


TTTTGAATGT 


TTGGAATGCT 


TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT 


ATCGAGTAGT 


CATTAAAACT 


AAGGATACTG 


5640 


CTGGTTAAGA 


GAAGACGGTA 


TACAATCAAA 


CCATTCACCG 


TGTAGCCGAA 


ATCGTTCAGA 


5700 


ATGAAGACTT 


GTATCAGAAT 


GAAGACTTGT 


ATAAGAAAGG 


TTTGAATGTT 


GAACTTGCGC 


5760 


ACCAACAAAT 


TAAGGGATTT 


TTTGAAGCAG 


AGTTTAAAAA 


TCGTATTAAT 


GGAGTTCTTA 


5820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC 


GTGTAAATAA 


AAAAACTATA 


CACCAGAGCA 


5880 


AC AAAAAC TC 


CATGATCAAT 


TTGAAGCAGA 


AGCAACGGAA 


GATGCTAAAA 


AACAAGGCGA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT 


GAGCATATCT 


AAGTCTAATA 


AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT 


AACCAATAGG 


CTAGCAAATG 


ACTTGAATAA 


6060 


TGTCTTGAAA 


CAGGTTGATA 


AAG AT ACT C C 


TAATACCCCA 


ACTTGGCTAA 


ACTCAGCTGC 


6120 
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TTCTAAAGCT 


AAAGATGATG 


ACAGAGTATA 


TAAACTACTG 


AAGACTCTTA 


TACCAGGAGA 


6180 


AAATTACCTA 


TCATGTTAAG 


GATAATCAGC 


TAGAAGTAGA 


AACAGATAAA 


TACACATATA 


6240 


CTGCCGCTAG 


AAATGGTAGT 


AAGGAAGTTG 


GTATTCAAGA 


GT C AG AT AT A 


GCAGCAACTC 


6300 


TAAGTGCCGA 


TGAATATAAT 


TCTAATCGCC 


AAACTTTTGA 


GAGAGAATAC 


AAATACAAAA 


6360 


GCAAATGCCC 


TTAATAATGG 


TTGGGCTAGA 


TCTGGTTCTG 


AAGAGTTCAA 


AAAGTTCTCC 


6420 


CACTTTGTAG 


GGGTAGACAA 


AGGGATTGTG 


CGAACGAATG 


TACTGACTGG 


TAAAAAACTA 


6480 


TCTGATAAGA 


TTAGGAAAGA 


AGTGGGCTCT 


GGAGATAGCA 


AACTAGGAAA 


AGGCGGCTAT 


6540 


TTCTCTACTG 


GGGATGTTCT 


ATTAGGAAAA 


GATGTTGTTT 


CTTATACCGT 


ACAAGTATTT 


6600 


TCAGAGAATA 


ATGAAAGAGT 


AGGAGTAAAC 


ACTCAAAGTC 


ACCGTGTTCA 


GTATAATCTC 


6660 


CCAATTCTAG 


CTGACTTTTC 


AGTCATCCAA 


GATACTGTGG 


AACCATCACG 


AACCGTTGTT 


6720 


GAAAAAATCA 


TTCCAAAACT 


AAATATTCCC 


GAAGAAGAGA 


AAGGGAAAAT 


AACCGAAGAA 


6780 


ATCAAGAAAA 


AGAAAAAAAC 


CTCAGAATTG 


GCAGAACTAA 


TCTCAGAAAA 


TGTGAAAGTT 


6840 


CGCTATGTTG 


ATGAACAAGG 


GCGTTTGCTA 


TCATTGAAAA 


ATGATACTGG 


AATTGGAGAA 


6900 


AAAGAAAGTG 


ACGGAACCTA 


CATTACCAAT 


AAAAAACAAC 


TGATTGGTAC 


CAGCTATAAT 


6960 


GTCACAGATA 


AAAAACT C AG 


TAGCATGACT 


ACTACTGACG 


GAAAATATTA 


TACTTTTAAA 


7020 


GAAGCAGATA 


CAAATTCTGC 


AAGTTTAACT 


GGGAATATTG 


TAAGCGAAGG 


TAGAACAGTG 


7080 


ACCTTAGTTT 


ATAGAGAAAG 


CGAAGCGCCA 


ACCACTGCTA 


CAGTAACAGC 


CAATTACTAT 


7140 


AAAGAAGGTA 


GGCAAGAGAA 


GTTGGTAGAG 


TCTGTTATAA 


AAGCTGATTT 


AGCGATAGGT 


7200 


TCTGAGTATA 


CCACAGAATC 


AAAAACT AT T 


GAAGGGAAAA 


CAACAACTGA 


GGACAAAGAA 


7260 


GACCGAGTTA 


TCACAAGGAA 


AACAACATAC 


ACCTTGGTAG 


CAACTCCTGA 


AAATGCGTAC 


7320 


CAGAAGACGG 


TGCAACAGTT 


GAC TAT TACT 


ACCGTGAGAA 


TGTTGAGGAA 


ACAGTGGTTC 


7380 


CCAAAACAGC 


AACCTCTACT 


GAGACGAAGA 


CTATAACGCG 


TAT CAT T CAT 


TACGTTGATA 


7440 


AAGTTACGAA 


CCAAAATGTA 


AAAGAAGATG 


TTGTTCAACC 


TGTAACCTTA 


AGCCGTACAA 


7500 


AAACTGAGAA 


CAAGGTCACG 


GGAGTTGTAA 


CCTACGGTGA 


ATGGACAACA 


GGAAACTGGG 


7560 


ACGAGGTTAT 


ATCTGGTAAG 


ATTGACAAGT 


AC AAAG AT C C 


AGATATTCCA 


ACAGTTGAAT 


7620 


CACAAGAAGT 


TACGTCAGAC 


TCTAGTGATA 


AAGAAATAAC 


GGTAAGGTAT 


GACCGTTTAT 


7680 


CAACACCAGA 


AAAACCAATC 


CCACAACCAA 


ATCCAGAGCA 


TCCAAGTGTT 


CCGACACCAA 


7740 


ACCCAGAACT 


ACCAAATCAA 


GAG ACT C C AA 


C AC C AG AT AA 


ACCAACTCCA 


GAACCAGGTA 


7800 


CTCCAAAAAC 


TGAAACTCCA 


GTGAATCCAG 


ACCCAGAAGT 


TCCGACTTAT 


GAGACAGGTA 


7860 



WO 98/18931 



PCT/US97/19588 



858 

AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 



7920 



TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GAT G AAAAAT 



7980 



AATAGATTTT AGAATCTAGG AACCAGGAAA AGC T C AC AG A TGTGGGCTTT TTTCCTGGTT 



8040 



TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 



8100 



TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 



8145 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
CGGTACCGGG AACGATACTT AGTCTAATTT TGCACCTTTT CCATGTATGG TAAAGGTTTT 60 

TCTTTTTTTA AAAAGGAAAA CGAGAAGAGG AGGTTCTTAT GAAAGCAAGC ATTGCCTTGC 12 0 

AAGTTTTACC CCTAGTACAG GGGATTGATC GGATAGCTGT TATTGATCAG GTCATTGCTT 180 

ATCTGCAwAC TCAAGAAGTG ACGATGGTAG TGACACCATT TGAAACGGTC TTGGAAGGGG 2 40 

AGTTTGATGA GCTTATGCGC AT TCT AAAAG AAGCGCTGGA AGTGGCAGGG CAGGAGGCAG 300 

ACAATGTCTT TGCCAATGTC AAAATAAATG TAGGAGAGAT TTTAAGTATT GATGAGAAAC 360 

TTGAGAAGTA TACTGAGACG ACACATTAGT CTATTGGGCT TTCTCGGAGT ATTGTCAATC 42 0 

TGGCAGTTAG CAGGTTTTCT TAAACTTCTC CCCAAGTTTA TCCTGCCGAC ACCTCTTGAA 4 80 

ATTCTCCAGC CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG GGCGACCTTG 54 0 

AGAGTGGCTT TACTGGGGCT GATTTTGGGA GTTTTGATTG CCTGTCTTAT GGCTGTGCTC 600 

ATGGATAGTT TGACTTGGCT CAATGACCTG ATTTACCCTA TGATGGTGGT C ATT C AG AC C 6 60 

ATTCCGACCA TTGCCATAGC TCCTATCCTG GTCTTGTGGC TAGGTTATGG GATTTTGCCC 72 0 

AAGATTGTCT TGATTATCTT AACGAC AAC C TTTCCCATCA TCGTTAGTAT TTTGGACGGT 7 80 

TTTAGGCATT GCGACAAGGA TATGCTGACC TTGTTTAGTC TGATGCGGGC CAAGCCTTGG 840 

CAAATCCTGT GGCATTTTAA AATCCCAGTT AGCCTGCCTT ACTTTTATGC AGGTCTGAGG 9 00 

GTCAGTGTCT CCTACGCCTT TATCACAACT GTGGTATCTG AGTGGTTGGG AGGTTTTGAA 9 60 

GGTCTTGGTG TTTATATGAT TCAGTCTAAA AAACTGTTTC AGT AT GAT AC CATGTTTGCC 102 0 

AT TAT T ATT C TGGTGT CGAT TATCAGTCTT TTGGGTATGA AGCTGGTCGA TATCAGTGAA 1080 

AAATATGTGA TTAAATGGAA ACGTTCGTAG AATTAGAATG TTTCTGAAAA AGAAAAGAGG 1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 12 00 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 12 60 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 132 0 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 13 80 

AAGAAAGTTC TTCTGACTTG GTTATCAACG G AAAGGC AC C ATTTGCAGTG TATTTCCAAG 144 0 

AC T AC AT GG C TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AAT C TG AT AA TGTAAGCAGT CCAAAAGACT 15 60 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 16 80 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA 186 0 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 192 0 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 19 80 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AG AAG AC TTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 222 0 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 2 2 80 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2 340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 24 00 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC T TG AT TC AAA AGGTGGATAA 24 60 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2 52 0 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2 5 80 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2 640 

GACAAAGATG GAACTCCACG CTTGGTATCT TG AG AT TC AC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 27 60 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2 82 0 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2 880 
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GTAGAAAAAT AGGGAGTTGG TGAAGATTAT CCTTTACCAG CGCCCTTTTT CTTTTAAAAA 2 94 0 

TGAGAAAATT TCGGTATAAT AGTCAAACAA GGTCAAGGTT TAAAGAGAGA GGTGGGTTTG 3000 

TTATGAGATT TAAAAATACA T C GG AT CAT A TTGAGGCCTA CATCAAGGCG ATTTTAGATC 3 060 

AATCTGGTAT CGTGGAGTTG CAACGGAGTC AGTTGGCAGA TACCTTTCAG GTTGTTCCTA 3120 

GTCAGATTAA CTACGTGATC AAGACACGCT TTACGGAAAG TAGAGGCTAC TTGGTTGAAA 3180 

GTAAGCGTGG TGGCGGAGGC TACATTCGTA TAGGACGGAT TGAGTTTTCT AGTC AT CAT G 324 0 

AAATGCTCCG GGAGCTGCTT TACTCGATTG GTGAGCGAGT CAGTCAAGAA ATTTATGAGG 33 00 

ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 33 60 

AATCAGTAGC TTTGGATCGC GTTTTAGGAG AAGAAGCTCC AGTTGTTCGA GCAAACATGC 3420 

TACGTCAGAT CATACAAGAG GTAGATAGAA AAGGGAAGTA AGATGAACTA TTCAAAAGCA 34 80 

TTGAATGAAT GTATCGAAAG T G C CT AC AT G GTTGCTGGAC ATTTTGGAGC TCGTTATCTA 3 54 0 

GAGTCGTGGC ACTTGTTGAT TGCCATGTCT AATCACAGTT AT AG TGT AG C AGGGGCAACT 3 600 

TTAAATGATT ATCCGTATGA GATGGACCGT TTAGAAGAGG TGGCTTTGGA ACTGACTGAA 3 6 60 

ACGGACTATA GCCAGGATGA AACCTTTACG GAATTGCCGT TCTCCCGTCG TTTGCAGGTT 3 720 

CTTTTTGATG AAGCAGAGTA TGTAGCGTCA GTGGTCCATG CTAAGGTACT AGGGACAGAG 3 780 

CACGTCCTCT ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT CTTGGAGAGG 3 84 0 

GCTGGTTTTT CTTATGAAGA CAAGAAAGAT CAGGTCAAGA TTGCTGCTCT TCGTCGAAAT 3 90 0 

TTAGAAGAAC GGGCAGGCTG GACTCGTGAA GATCTCAAGG CTTTACGCCA ACGCCATCGT 3 9 60 

ACAGTAGCTG ACAAGCAAAA TTCTATGGCC AATATGATGG GCATGCCGCA GACTCCTAGT 4 02 0 

GGTGGTCTCG AGGATTATAC GCATGATTTG ACAGAGCAAG CGCGTTCTGG CAAGTTAGAA 4080 

CCAGTCATCG GT CGGG AC AA GGAAATCTCA CGTATGATTC AAATCTTGAG CCGGAAGACT 4140 

AAGAACAACC CTGTCTTGGT TGGGGATGCT GGTGTCGGGA AAACAGCTCT GGCGCTTGGT 4200 

CTTGCCCAGC GTATTGCTAG TGGTGACGTG CCTGCGGAAA TGGCTAAGAT GCGCGTGTTA 42 60 

GAACTTGATT TGATGAATGT CGTTGCAGGG ACACGCTTCC GTGGTGACTT TGAAGAACGC 432 0 

ATGAATAATA TCATCAAGGA TATTGAAGAA GATGGCCAAG TCATCCTCTT TATCGATGAA 43 80 

CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 444 0 

TTGAAACCAG CCTTGGCGCG TGGAACTTTG AGAACGGTTG GTGCCACTAC TCAGGAAGAA 4 500 

TATCAAAAAC ATATCGAAAA AGATGCGGCA CTTTCTCGTC GTTTCGCTAA AGTGACGATT 4 560 

GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG GTTTGAAGGC GACTTATGAG 4 62 0 

AAACATCACC GTGTACAAAT CACAGATGAA GCGGTTGAAA CAGCGGTTAA GATGGCTCAT 4 680 
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CCAGTCGTCA 


CTTGCCAGAC 


TCTGCTATCG 


ATCTCTTGGA 


TGAGGCGGCA 


4740 




AAAA1 AAGGC 


TV 7\ TV /*^ "A m y^i m "TV 

AAAGCATGTA 


TV TV 1\ TV y""» TV y— » ^— » 

AAAGCAGACG 


ATTCAGATTT 


GAGTCCAGCT 


4800 




1 GAI GGATGG 


CAAGTGGAAA 


CAGGCAGCCC 


AGCTAATCGC 


AAAAGAAGAG 


4860 


G AAG 1 AC C TG 


T CT AC AAAG A 


CTTGGTGACA 


GAGTCTGATA 


TTTTGACCAC 


CTTGAGTCGC 


4920 


1 1 L» 1 C AGGAA 


TCCCAGTTCA 


AAAAC TG ACT 


CAAACGGATG 


CTAAGAAGTA 


TTTAAATCTT 


4980 


(jAAbOAbAAC 


TCCATAAACG 


GGTTATCGGT 


CAAGATCAAG 


CTGTTTCAAG 


CATTAGCCGT 


5040 


GCCATTCGCC 


GCAACCAGTC 


AGGGATTCGC 


AGTCATAAGC 


GTCCGATTGG 


TTCCTTTATG 


5100 


TTCCTAGGGC 


CTACAGGTGT 


CGGGAAAACT 


GAATTAGCCA 


AGGCTCTGGC 


AGAAGTTCTT 


5160 


TTTGACGACG 


TK -)V ■ 1 1 f—*m A .i" > y— 4 J ■ ■ 

AATCAGCCCT 


TATCCGCTTT 


GATATGAGTG 


AGTATATGGA 


GAAATTTGCA 


5220 


GCTAGTCGTC 


TCAACGGAGC 


TCCTCCAGGC 


TATGTAGGAT 


ATGAAGAAGG 


TGGGGAGTTG 


5280 


ACAGAGAAGG 


TTCGCAATAA 


ACCCTATTCC 


GTTCTCCTCT 


TTGATGAGGT 


AGAGAAGGCC 


5340 


CACCCAGATA 


TCTTTAATGT 


TCTCTTGCAG 


GTTCTGGATG 


ACGGTGTCTT 


G AC AG AT AG C 


5400 


AAGGGACGCA 


AGGTCGATTT 


TTCAAATACC 


ATTATCATTA 


TGACATCGAA 


TCTAGGTGCG 


5460 


AC 1 GCCCTTC 


GTGATGATAA 


GACTGTTGGT 


TTTGGGGCTA 


AGGAT ATT CG 


TTTTGACCAG 


5520 


T\ T\ 7V 7\ m TV m/T /"> 

GAAAATATGG 


AAAAACGCAT 


GTTTGAAGAA 


CTGAAAAAAG 


CTTATAGACC 


GGAATTCATC 


5580 


AACCGTATTG 


ATGAGAAGGT 


GGTCTTCCAT 


AGCCTATCTA 


GT GAT CAT AT 


GCAGGAAGTG 


5640 


GTGAAGATTA 


TGGTCAAGCC 


TTTAGTGGCA 


AGTTTGACTG 


AAAAAGGCAT 


TGACTTGAAA 


5700 


1 1ACAAGCTT 


CAGCTCTGAA 


ATTGTTAGCA 


AATCAAGGAT 


ATGACCCAGA 


GATGGGAGCT 


5760 


CGCCCACTTC 


GCAGAACCCT 


GCAAACAGAA 


GTGGAGGACA 


AGTTGGCAGA 


ACTTCTTCTC 


5820 


AAGGGAGA1 1 


TAGTGGCAGG 


C AG C AC AC TT 


AAGATTGGTG 


TCAAAGCAGG 


CCAGTTAAAA 


5880 


ill G A 1 A 1 1 G 


O 7\ m TV TV TV TV TV TV 

C AT AAAAG AA 


T AAAAGT AT C 


AGCATCTGAC 


CATAAGTCAC 


AGTGGAGTGA 


5940 


AATTCAATGA 


AAATCAAAGA 


GCAAACTAGG 


CAGCTAGCCG 


CAGGTTGCTC 


AAAACACTGG 


6000 


TTTGAGGTTG 


CAGATAGAGC 


TGACGTGGTT 


TGAAG AG AT T 


TTCGAAGAGT 


ATGAAACTAA 


6060 


AACCTATAGC 


TTCTAAACGA 


TCCGTGGTTT 


TCATCATTCA 


ACACAAAATT 


CATATGTTTA 


6120 


TTACCCTCCG 


TCGTATTTGT 


CTT AG AG CGT 


GTGTAGTAGA 


AAAAGAGCAG 


TC TT AT C T G A 


6180 


AATTTTTATT 


CTTTCAAAAG 


AGACCTGTTT 


CTTTTTTGCA 


TGTCAAATCC 


GTTCTAGCTG 


6240 


GTATTTGAAA 


AATCAAACTA 


ATATTCAATG 


AAAATCAAAG 


AACAAACTAG 


GAAGCTAGCC 


6300 


GCAGGTTGCT 


CAAAACACTG 


TTTTGAGGTT 


GTAGATAGAG 


CTGACGTGGT 


T TGAAG AG AT 


6360 


TTTCGAAGAG 


TATAAGCTGC 


AAGATGAATG 


ATTTTCTTGT 


ATTGACGTTG 


TTGTTGACAA 


6420 
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AAAGTAGCGG 


ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA 


TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGG 


AGAGAGATGA 


6540 


ACATTTTAGT 


TGCAGATGAC 


GAGGAAATGA 


T T AG AG AAGG 


AATTGCAGCA 


TTTCTGACAG 


6600 


AAGAGGGTTA 


TCATGTCATT 


ATGGCTAAGG 


ATGGACAAGA 


GGTCTTGGAA 


AAATTTCAAG 


6660 


ATCTCCCTAT 


CCATCTCATG 


GTACTGGATT 


TAATGATGCC 


TAGGAAGAGT 


GGTTTTGAAG 


6720 


TGTTAAAAGA 


AATCAATCAA 


AAGCACGATA 


TTCCTGTCAT 


CGTCTTGAGT 


GCTCTGGGAG 


6780 


ATGAAACTAC 


TCAGTCACAG 


GTATTTGATC 


TCTATGCTGA 


TGATCATGTG 


ACAAAACCTT 


6840 


TTTCTTTGGT 


ACTGCTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGACGTTAC 


TACGTCATAG 


6900 


AGGATCTTTG 


GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 


CTCTTACAAA 


GC AC AT TATA 


6960 


AAAATGAAGA 


AATTGATCTC 


AAAC C AAAGG 


AATTACTGGT 


ACTAAAGTGT 


TTGATTCAGC 


7020 


AT AAAAAT C A 


AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAAGA 


AATTTCAAAA 


GATGTAGCTG 


7080 


ATTTACCTTG 


TGATAGGGTC 


GTTGATGTCT 


ATATTCGTAC 


TCTTCGCAAA 


AAATTAGCTT 


7140 


TAGATTGTAT 


CGTGACTGTG 


AAAAATGTTG 


GGTATAAGAT 


TAGCTTATGA 


TAAAAAATCC 


7200 


TAAATTATTA 


ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 


CTAGGTGGTG 


TTGGTCTAGT 


7260 


CATTCATATA 


GCTATTTATT 


TGACCTTTCC 


TTTTTATTAT 


ATTCAACTGG 


AGGGGGAAAA 


7320 


GTTTAATGAG 


AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


ACTAAGACAT 


CTGATGAAAT 


7380 


TCCAAGCTTA 


CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 


TCTGCTCACC 


TTAAAAGAGA 


7440 


T AT TGT AG AT 


AAGCGGCTCC 


CTCTTGTGCA 


TGACTTGGAT 


AT T AAAG AT G 


GAAAGCTATC 


7500 


AAATTATATC 


GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATGGTAAAC 


AGGTAACCGT 


7560 


GCAATTTGTT 


CACGGGGTGG 


ATGTCTACAA 


AGAAGCAAAG 


AATATTTTGC 


TTTTGTATCT 


7620 


CCCATATACA 


TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


TTTTCTTATT 


TTTATACTAA 


7680 


ACGCTTGCTC 


AATCCTCTTT 


T TT AC AT T T C 


AGAAGTGACT 


AGTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT 


CGTTTTGATG 


AAAGTAGGAA 


AGATGAAGTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG 


TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTGGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA 


TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA 


GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TTGGAGATTA 


7980 


CAAAGATCAT 


C C AAAAT AT A 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA 


TGAGCCACTT 


8040 


ATTAGAAGAA 


GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGG 


ACAGAGTGTC 


GTGAGACCTT 


8100 


GACTGTTAAG 


CCAGTTTTAG 


TAGATATTTT 


AT C AC GT T AT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA 


ATTGAAAATC 


AATTGACAGA 


TGCTACCAGG 


GTCGTCATGA 


GTCTTAGGGC 


8220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT C ATT C TC AAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 852 0 

AG AT T AGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT AT GGT AT TAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8 697 



(2) INFORMATION FOR SEQ ID NO : 12 4: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 4: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 120 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 3 00 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 3 60 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 42 0 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA C AGC C ACT AT TGAAGAGGCT 4 80 

AT C C AGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG AT C AG C AAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 6 60 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 72 0 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 9 00 
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TTCTTGTAAA TCCAAAACTA CCTTCTTAGT GACTTGAGCA AGATTTTGAC GCAAATCATC 9 60 

TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT TGGTAAATCT TATCTGGATT 102 0 

AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT TTATTCTTTG 1080 

TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA ATGTTCAAGT 1140 

AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT ACAATCGCCA CTGACGAATC 12 00 

TTTTCATGGT GGCCACTCAT CAATACATCT GGCACGACCA TGCCTCGATA ATCATAGGGA 12 60 

CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG GTGGCTAGAC 132 0 

TCCTTGCCAA TCACTTCTGG AATCAGGCGA AC TGT AGC AT CAATCATGGT CATAGCTGCC 13 80 

AATTCTCCAC CAGTGAGGAC ATAGTCACCT AGGGAAATCT CATCTGTTAC CAAGGTCTTA 1440 

ATGCGCTCAT CATAACCCTC ATAGTGCCCA CAGATAAAGA TTAGCTCTTC CTCTTGAGCC 15 00 

AAATCTTCAG CATAAGCCTG ATCAAACTGC TTTCCAGCAG GATCAAGGAG AATAACGCGC 15 60 

GGATTTTTCT TTTCAATAGC ATCAAAGGAA TCGAAAATAG GTTGTGCTCT GAGCAACATG 162 0 

CCCTGACCGC CTCCGTAGGG CTCATCATCT ACATGACGGG CCTTTTCAGC ATTTTCTCGA 16 80 

AAATTATGAT ACTGGATATC CAAGAGCCCT TTTTCTCGAG CCTTTCCAAC GATTGAGTGC 174 0 

TCCAGTGGAG AAAACATCTC TGGAAAGAGG GTTAAAATAT CAATCTTCAT CGTCTAACCC 18 00 

TTCTAAGATT TCCACATCGA CCCGTTTACT TGGAATATCA ACATTGAGAA CCACTGGTGG 1860 

GATATAAGGT AAAAGCAAAT CACGTTTGCC TTTTCGTTTG ACCACCCAGA CATCATTAGC 19 2 0 

ACCTGGTTGC AGGATTTCCT TGATGGTTCC AACCAAGCTA TCACCCTCAT AGACTTCCAA 19 80 

ACCGATAATC TCGTGATAGT AAAATT C AC C ATCGTCTAGG TCATTCAAAT CTTCCTCAGC 2 04 0 

GACCTTGAGA CTGTATCCCT TGTACTTTTC GATAGTATTG ATATGGTACA TATCTTTGAA 2100 

TTTAATAATG TCAAAGTTCT TCTGTTTACG GTGGCTAGCG ATGGTCACTG TTTGGACAAA 2160 

CTGATCTTTT TCATCAAACA AAACCAGCTC AGCTCCTTTT TTAAACCGTT CTTCTGCAAA 2220 

AT C CGTC AC A GACAAGACTC GCATCTCCCC CTGTAATCCC TGCGTATTAA CGATTTTCCC 2 2 80 

AACATTAAAG TAGTTCATCT TGTCTCCTGT AATCTCCTTT TTTCCATCTT ATTCTAACAA 2 34 0 

TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT GGTGATTCCC 2 4 00 

TCCTAAAATG AGTTTAGTAT TGGAAGTCCA ATATTCTGAT TCTCTGTACT CTTTTTCTCT 2 4 60 

ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT CCTCAAAATC 252 0 

TTCCTCAGTA ATCTCTCCAG ATATCTGAAA TTCTGGATCT TGATTTTCCA ACTCTAAGCC 2580 

TTTTTCTTGC ATTAATTCCC AGATTTTTTT ATTCGTTTCA GGACTAAATG TTGCTTGAGT 2640 

TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC AGCCTCATCT GCTCTTCCAT 2 7 00 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2 7 60 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 282 0 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2 94 0 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT G AT TC T AT AT AATGAAT T AT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 32 40 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3 300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA AT GAT AT C AG 3 3 60 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 342 0 

TTCAAAACCT CG AC T ACT T C TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 34 80 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3 54 0 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG T T AG AAATCG 3 600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3 660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3 720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 37 80 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3 84 0 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3 90 0 

AATCGCAATA ATGAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3 9 60 

AAAAT TT AG A ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 402 0 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4 08 0 

CTTGGTTTTC AG C AACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA G AACC C AT AC 42 00 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 42 60 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5: 

AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATGGT TAAAATAAGG 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 12 0 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 240 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 3 60 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 42 0 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 480 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 54 0 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 6 00 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 6 60 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 84 0 

TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 9 00 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 9 60 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 102 0 

T C AAAATT AT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 12 00 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA C C AT TT AC AG TATTACCCTT GATGTAAGCT 12 60 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 13 2 0 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 13 80 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 1440 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 1560 



WO 98/18931 



PCT/US97/19588 



867 

GCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AAC T TAT AAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAG C CGC AGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 192 0 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATG GTTGCGATAC 2220 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 2 2 80 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2 340 

TG ATGT T AAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AAC AT AC TG A 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGCGTAGG 2 520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2 580 

TGAAGAGGGC T T GT AGG ATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2 640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2 760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2 82 0 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2 8 80 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2 94 0 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3 000 

TATACGTCCA AAACGTTGGT TACATTGTAA AG CTG AC C AG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGG AT C A ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

C TG CT AT AT C CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT 


AG AAAT T G AC 


TTCTTTGAGA 


CCAAGACTTT 


C C AAT AG AT G 


ATTGAACACC 


3360 


CCTTTATCAA 


TATTTAGGAA 


GGCATCTGTA 


AAGAAACTGA 


TGATAACCCA 


AGACAAGAAG 


3420 


TAAGGGAACA 


ACATAGAAGT 


TTGAAAAATC 


TTCACCATTC 


TCTTAGAACG 


GAGCTCGCTG 


3480 


AGGATAATGG 


CAATCCCTAC 


AGATACAACT 


AAACCTAGAA 


AGATAAAGCC 


AAG AT TGT AG 


3540 


AGGACAGTAT 


TTCGTGTGAT 


AATAAAGGCG 


TCTCTTGAAC 


TAAATAAGAA 


TCTAAAATTA 


3600 


TCGAGTCCGA 


CCCATTTACT 


ATTTATGATA 


CT AT C T AT G A 


AACCATTACT 


GGTCATGTGG 


3660 


TAGTCTTTGA 


AGGCAACCAC 


GTTCCCAAAT 


ACTGGAATGT 


AAAAGAATAG 


AATCAACCAG 


3720 


AGTGCCCCTG 


GCAAAACCAT 


CAAGAGAAAG 


ATCCAGTTGT 


CTCTCAATGT 


TTTTGAAAAC 


3780 


TTTTTCATAA 


TTTCCTCCCT 


TTTTATTTTG 


ATATCCATCT 


AAAAATTCTT 


TTTTAGACTT 


3840 


TTGATAACGA 


TT AC ATT ATT 


AGTATACTCC 


TATTTGCAGG 


TTAGGTTAAA 


CTCCTAATTA 


3900 


TAGAAAAAAC 


TCCACAAATT 


ATGTAGCAGA 


TTTAAAACTT 


TATCACCACT 


ATCAAACAAA 


3960 


TGTCCTAAAT 


CAATTGTTTA 


TTTTATCTCT 


AT TAG C C C AG 


TGATGGCGTC 


ACTCTGTTAT 


4020 


AAGCATCCAA 


CAACGGGGTA 


TACTGAAAAA 


TCTCCAGACT 


AGGGAACTCA 


G CG AT AG T T C 


4080 


CTAATCTGGA 


GATTTTTAAT 


ATGTTATTAG 


GCGTTTGCTT 


TCAACTTAGC 


AAT AAC CT C T 


4140 


TTAAGATTAT 


CAATCAACTC 


TGCTGCAGTA 


TGCTCAGAGC 


CTTTTTCATC 


TGCCAAGAAC 


4200 


AAAACTGCTT 


TTTGAAGTTC 


TTTTTGAGAG 


TTTTCAAGGA 


CATCCTTATC 


TACTGTTTCA 


4260 


AGGTTTGAGT 


CTTTAAGAAG 


TTTACTTAAT 


TCCTTGGCTA 


ATTTCTTGAG 


TTTGATTTGC 


4320 


AG AC TC AT CT 


TCTCCTGCTG 


TTTCTTTGCC 


CGCTGTTTGT 


CCTCCATCCT 


TAGTTGCTGA 


4380 


CTGGCTTTCC 


TTAATGGACT 


CTAGGGAAGC 


AATGGCATCT 


TTGACTGTTT 


GC AAG AT AT C 


4440 


ACGTAAACCT 


TGCTCTGTCA 


AACT AT CAT C 


TGCAAAAGCT 


TT AT T AG C C T 


CTGCCAAAAC 


4500 


CAGACGTGCT 


GAATCTGTGG 


TAGGATTCGA 


TACACCTGTC 






a r z. n 


TAAGGTTTGA 


GTCTGCTTAC 


T AAT AC TAG A 


CTAAAATCAA 


AAAG T AT TAT 


ATAACAGTGA 


4620 


TATGAAATCA 


ACTAAAGAAG 


AAATCCAAAC 


CATCAAAACA 


CTTTTAAAAG 


ACTCTCGTAC 


4680 


AGCTAAATAT 


CATAAACGCC 


TTCAAATCGT 


TCTATTTTGT 


CTGATGGGCA 


AATCTTATAA 


4740 


AGAGATTATA 


GAACTTTTAT 


AGTAGTTTGA 


AATAAGATGT 


GAACATCTCT 


ATCAGGAAAG 


4800 


TCAAATTAAT 


TTATAGAAAT 


ATTTTAGCAG 


CCAAGGTGTA 


C TG T TAT AG A 


TTCAATACAC 


4860 


TATACTTGGT 


GGTTTAGCTC 


G 








4881 


(2) INFORMATION FOR SEQ ID NO: 126: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 6 0 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC T T AT AAAGG A ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 24 0 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG T C AC AC AAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 42 0 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 54 0 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 720 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 84 0 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 9 60 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGT C AC AG AT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA T CAT AC T AC A 108 0 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 114 0 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 12 00 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 12 6 0 

TAAACACTGG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 13 2 0 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 13 80 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 144 0 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 15 60 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 162 0 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG C C CAT AC AAA TCTCTTCGCG 16 8 0 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 18 60 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 192 0 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 19 80 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2 040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 222 0 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 22 80 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2 34 0 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2 4 00 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2460 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2 52 0 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2 5 80 

AG C AG AT AAA GCTAATGCCA AT CC AG AT AG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2 64 0 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTGC 2 7 00 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 27 60 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2 82 0 

TATCTTTACT G C AG ATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2 880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2 94 0 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA T T AAGTTT AT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3 0 60 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 312 0 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3 24 0 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3 3 00 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 33 60 
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TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3 54 0 

CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3 600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3 6 60 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 37 80 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 3 84 0 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3 900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 3960 

ATGGCATGAA ATACATCTCT TATCGTAGCG AGCCAAATGT GAATCCAGAA TCAACAACTG 4 02 0 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG AC AGC G AT C G ATTCCGTGGT GTTCCTTTCT 4 080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 414 0 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 42 60 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 43 2 0 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 43 80 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 456 0 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTAGATT 4 62 0 

AAACCTTCCA AG AG AC CT T T CATAAAGTTT TCTGAGTTAA ACTCTCCAAT AT CAT CG ATT 4 6 80 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 474 0 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG T AC TAGGGC A TTTTGACCTG T TG AT G CAT C AAGTGCCAAG 48 60 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4 92 0 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4980 

AT AC C TT C AG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 

CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 52 2 0 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 52 8 0 

AGCTCAACCA ATTTCTCAAT GAT G AC AC G A CGAAGTACAT CAGGTTTCTT GGCATTTTCA 53 4 0 

AGCTTGGCTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CGACTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 54 60 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 564 0 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 57 00 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 57 6 0 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 582 0 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5880 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GC AC AT AT TC TTCGATAGCC 5 940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 60 60 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 612 0 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GGCTGCATCC 62 4 0 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 63 00 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 63 60 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 642 0 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT AT C AAGG AT T 64 8 0 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 6 540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 67 80 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6 84 0 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 69 00 
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TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT AT CG AC AT AG TCCAGAACAA 6960 

TAT C AATGG A TTCAAAGCCA GTTGTCATGG CCTTAACACC AGG AACGTTT TCGTTTACCC 702 0 

AAGCCTCCCC ATCTTCCAGC GTTTCTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7 080 

CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT 7140 

TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 72 00 

GTTTATTGAT AT C T AC AT AA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 72 60 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCTACCAAA CTGCCATTTT 732 0 

CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA G AC AG AAAT C 73 80 

CCCGACCCGA AGCTACCGCA AAGTAAATCC CTTTTTCCTT GTAGGAAACC AAG AG AG AC T 7440 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7 500 

CTACTAGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 7 56 0 

CTTTGAAACA CCCGATTCTT GCATGGTCAC T C CAT AG AT G GAATCAGCCG CTGCCATGGT 7 620 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCTGTCCTTG TCAAAGCGGT TGAGGTAATC 7 6 80 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 7740 

TGGAATAGTC TTGACACGAA TAATGGAGAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7 800 

ACCACCACTC ATGAGATTAA GAG ACT GG AT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 7 8 60 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7 92 0 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC TCAAAGGTTG AT T T AAAGCG 7980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAAGG AGCAGGTTTT TCGCAGACAA 804 0 

AAT AT C AT C A CGTTGGCTAT TT AGG AAAT C CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATAGCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 822 0 

GATCTGGTAC TGGTCTGTTA ATTGACTTTG TAGATGGCGC AAGCGCTCGC TAACCTTTTC 8280 

TTTCTTGGCT TCAGCACGAG TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCGAGCCTG 8340 

AT C C AAAT G A CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT 84 00 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 8460 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 852 0 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTT AAG C AAA CGAGCTTGCG CTAGCTCTTC 8 640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 87 60 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCTTCTTGGA GTTTCCAAAG 882 0 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 888 0 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 894 0 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 90 60 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 9120 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCAAGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 924 0 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 9360 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 9420 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 94 80 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCTTTTTCT TGGAGAACAC TCTTAACACC 9 540 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTGGCTTTGT TGAGCTTGAT AGGAAGTTTT 9660 

CTGCTCCTCT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 984 0 

CTCATTCTCA AT ACGGGT C A ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAG AG C T AC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 99 60 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 10020 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 1008 0 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 1014 0 

GGCTAATTTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 10200 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGTTCT TGAACCTGAG TCAACTCTTC 103 2 0 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG ATTTGATTAT CCAACTCGTA GATAATGTCC TCTAAGCGGT C C AG ATT AT C 10500 

CTGAGTTTGC TGCAGTTTAC TCTCGGTTTC TTTTCTGCGA GTCTTGTATT TTAAAACTCC 105 60 

AGCAGCTTCT T C AAAAAT AG CTCGTCGTTC CTCAGGCTTG GAATTAAAAA TCTCCTCAAC 10620 

CTTCCCTTGG GAAATAATAG AGAAGGAATC TCGTCCCAAT CCAGTATCCA AGAAGAGGTC 10680 

ATGAATATCA CGCAGACGGA CTTTCTTGCC GTCAATCTTG TATTCGCTAT CTCCACTACG 10740 

ATAGACATGG CGTTCCACCC TGATTTCTTG ACCTGCATCC TTGATAAATC CGTCATGATT 10800 

ATCCAGAGTC AC AAC TAC AG AAGCATAATT GAGCGGTTTG CGACTTTCGG TTCCAGCAAA 108 60 

GATGATATCC GGCATCTTGC CCCCACGGAG ACTCTTGACA CTAGACTCCC CCAAAGCCCA 1092 0 

ACGCAGACTT TCTGTAATAT TGGACTTTCC AG AT C CAT TG GGTCCAACAA CTGCCGTCAC 109 80 

ACCTTGGTCA AAAACG AC CT TGGTCTTATC AGCAAAAGAC TTGAACCCCT GAATTTCGAT 11040 

TTCCTTTAAA TACATGAATC CAGCCCCTTC TCAACGGCAT TTTTGGCAGC TTCCTGCTCT 11100 

GCTAATTTCT T AG AACG AC C TTGGCCTTGA CCGATGCTCT TACCTTCAAC AAGAACTTCT 11160 

ACATCAAAAA CCTTATCGTG AGCAGGCCCT GTTTCAGAAA TCACCTGATA ACGAATAGCC 112 2 0 

ACATCACCAT TGACCTGAAG CAACTCTTGG AGATGGGTTT TATAGTCTGT AATCATCTCA 112 80 

AACTCGCCTG CTTCAACCTT AGGAATCATG ACTTGATAGA TAAATTCCTT GACCTTGGCC 1134 0 

ACATCCTTAT CCAAAAGAAG GGCACCAAGA AAGGCTTCAA AGGCATCACC AAGAATGGTG 114 00 

TCACGATTGC G AC C AC CTG A TTTTTCTTCC CCTTTACCCA ACTTGATAAA CTGGTCAAAC 114 60 

TGGCAATCAC GCGCAAAACC AGCTAAACTC TCCTCACGGA CAATCATAGC ACGGAGTTTT 11520 

GATAGGTCAC CTTCAGGCTT TT TAGG AT AT TTTTTATATA GATATTCTGA AATCAATAAC 11580 

TGTAGAACAG CGTCTCCTAA AAATTCCAAG CGTTCATTGT GTGAAATTTT TAAGAGGCGG 1164 0 

TGCTCATTGG CATAACTCGT ATGAGTAAAG GCAGTTTCCA GTAACTTTTT GTCTGCAAAT 117 00 

TCGATTGCAA AATGATTCTT TAGTACAGTT TGTAATTCTT TCATACCAAC CTCTTTCTAA 117 6 0 

C TG AT AATAG TCCTTTTTAT TATATCAAAA AAAGCCCCCT. GAGTCACTCT AAAACGGGAC 1182 0 

TGGAAAGCAT TTGGGAATTC TTTAGACAGA GATTCTCAGT TTTAGCGGCA AATTTGGGTC 118 80 

AGGATAAAGA AAAAAGCCCT ATTAAAGGCT TTTTAGGATG TTTACATCCA CCCTGAGGGA 11940 

ATCGAACCCC CATCTCAAGA ACCGGAATCT TACGTGATAT CC ATT AC ACT AAGGGTGGAA 12000 

ACTTGTTTTA TTATAACAGA AATTTGCTCT AATAACAAGT TTTTTGGTCA AAGACCCCGT 12 0 60 

CTTAGTGGGA AGCATCCCCA TTCCAGATGG AGTTTTTCAC GATCACATAA TCAACGTGTT 1212 0 

TAAGGTCAGC AACCTGACGT CCACCTGCAT AAGAAATAGC ACTTTGAAGG TCTTGTTCCA 12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12 240 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12 300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGGGAACCAA 12 3 60 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 12 42 0 

CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 12 4 80 

GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 12 54 0 

TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTGTTC 12 600 

CCACATTTCC AG C AATG AC A AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12 660 

AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GG AG T AT C AG 12 72 0 

CCTTGAGCTG GCTAACAAAA T CAT ACT CAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 12780 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 12 840 

GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 12 9 00 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12 960 

TATCCGCTTC TGCACGGCTT T T AATG AC AC ATTTATTTGG AATCAATTGA ATATCTTCGT 13 020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATG AC C T AC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 
(2) INFORMATION FOR SEQ ID NO : 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 7: 
CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AG AT T TT AG C AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 12 0 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 2 40 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 3 60 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 42 0 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 54 0 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AAC AC GT AAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 84 0 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC AT T AC AC TGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 114 0 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 12 00 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 12 6 0 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 13 2 0 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA ATCATTCTGA ACATTAAAGG 13 8 0 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAAT C AT C GAACTGGATG TTGAAGGTCC 14 40 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AG AT AG CG AT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 15 60 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 162 0 

TGCTGTAGAT T CT AT T TATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT T CG AC AAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 174 0 

T ATT C C AG AA GATGCTTTAG GGCTTTCAGC ACGTATTTTG AC AG AAC AT C TTGATTTGTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC 1860 

TGACGACCGT AT T TT AG AT C GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAATACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 19 8 0 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2 040 

TGATTTGGGT CTTGG AT T AA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2220 



WO 98/18931 



PCTYUS97/19588 



878 

ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2 2 80 

CGTAATGAAA TCGCATCTGA AAAC T ATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2 34 0 

GC AC T TC AAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 24 00 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 24 60 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 252 0 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 2 64 0 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 27 00 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2 7 60 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 294 0 

TTGAATGTAA AAATCAATAT T C AG AGTGC A GCGATTTTCG AAGCTATGTA T AAT AT C TAG 3 000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG C CAT GAT T G A GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA T C C AG AT AT C 312 0 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATT AC G AC AA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCGGC TGAGTTGGGA ATTCCTATCG TTAATAAGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 3 3 00 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 33 60 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 342 0 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3 4 80 

ATGACGGCTG TGG C AG AT AT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 354 0 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3 600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3 6 60 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3 7 80 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCGGTT 3 840 

GG AG AC T C TG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 3 9 60 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 4020 
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GCTGCAGTGC AAAATGGCTC TCTTAATTTA GAAAAACTAG AAGCTATGAC GGCTATCTGT 40 8 0 

TCTGTTGGAT TGGATATGAT TGCCATCCCA GAAGATACGC CTGCTGAAAC TATTGCGGCT 4140 

ATGATTGCGG ATGAAGCAGC AATCGGTGTT ATCAACATGA AAACAACAGC TGTTCGTATC 4200 

ATTCCCAAAG GAAAAGAAGG CGATATGATT GAGTTTGGTG GTCTATTAGG AACTGCACCC 42 60 

GTTATGAAGG TTAATGGGGC TTCGTCTGTC GACTTCATCT CTCGCGGTGG ACAAATCCCA 4320 

GCACCAATTC ATAGTTTTAA AAATTAAGAA AATAGGAGAA ATTTTAAGTT CTATTTAAGA 43 80 

TTAGACGTGT ATACTATAAT CATTAAATAA AGACCTCCTA ATATTATTTG AAACAGATAA 4440 

CACTGAATTA GTTTGAATTT GATTTTCATC TAATATCTTT ATTTAATGAA CTCCTAAACT 4500 

TTTTCATAAT AATCTCCTTC AAAAGTCGCC TGTATGGGTG GCTTTTATTT TATCATTCAT 4 5 60 

GATATAATAG AAGCAAACGG AGGACGGAAA ATGGTAAAAG TACGATTGTA TTTGGTACGT 4 620 

CAT GGC AAGA CCATGTTTAA C AC G ATTGGT CGCGCGCAAG GTTGGAGCGA TACTCCCTTA 4 680 

ACTGCTGAAG GTGAACGAGG GATTCAAGAG TTAGGAATCG GTTTGCGAGA ATCTGATCTA 474 0 

CAGTTTGAGC GTGCTTATTC GAGTGATTCT GGTCGTACCA T T C AG ACC AT GGGAATTATC 4 800 

CTTGAAGAAC TTGGCTTGCA GGGGGAAATC CCTTATCGCA TGGACAAGCG TATCAGAGAA 4860 

TGGTGTTTCG GTAGTTTTGA TGGAGCCTAT GATGGCGATC TTTTCATGGG CATTATTCCT 4 92 0 

CGTATCTTTA ATGTGGACCA CGTTCACCAA TTGTCTTATG CTGAACTGGC TGAGGGCTTG 4 980 

GTAGAGGTCG AT AC AGCT GG TTGGGCTGAA GGCTGGGAAA AACTCAGTGG CCGAATCAAG 5040 

GAAGGCTTTG AAATGATTGC AAAAGAAATG GAAGATCAAG GTGGAGGTAA CGCCCTTGTT 5100 

GTCAGCCATG G AATG AC T AT T GG AAC C AT T GTTTATCTGA TTAATGGCAT GCATCCGCAT 5160 

GGTCTGGATA ATGGTAGCGT GACAATCCTT GAATATGAGG ACGGCCAGTT TAGGGTTGAA 522 0 

GTTGTCGGTG ACCGTAGTTA CCGAGAGCTA GGACGTGAGA AGATGGAAGA AGGCTCTATT 52 80 

TAATCAGTCT AGACTTGCTT GCCATGAGCT AGGGATTTGA TAAGAATATC AAGATAAGAA 5340 

AAAAC AG C CG AGGGCACTCC TTTCGGCTGT TTTTGATGTG GAAAACTAAA GTGTAATGCT 5400 

ATTGCTTTTA GAGATTTTCA TAAACAAGAG CAAGGAACCT ACTGTTAGAA CAGTCAGGAT 54 6 0 

AGTTGACAAG GTTGCGGCTA CACCGTAATT TCCTCTGAGA ACCTCTGTAT AAATAGCTAC 552 0 

AGTCATTGTT CTTGTTTTGA CATTGTAGAG GAGGATAGAA GTAGAGAGTT TTGAAATCAT 5580 

TGTGACTCAA GATAAGATGG CTCCAGAAAT GATACCAGAT AGCATCATTG GAGTTGTAAT 5640 

CTTAGCAAAG GTATTGAGAC G ACT ACTT CC TAAGCTTTCA GCAGCTTCTT CAATACTTGG 57 00 

TGCTATTTGT TGTAAGCTAG CAACAGATGA GCGAATAGTA TAAGGTAATC TTCTGGCAGA 57 60 
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TAGAGACATA AT C AAG AT G A AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 582 0 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 58 8 0 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 594 0 

GGAGATAAAT GTCGCAAATA G AAC AAC TAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 612 0 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG C AG ATT CAT G CTGAAACTGT AGCGGTTTGC 62 40 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 63 60 

CCCTTCGCCA AT C AAC AT AG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 642 0 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 648 0 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 654 0 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6 600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 67 8 0 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 684 0 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7 02 0 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 7 08 0 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 714 0 

AGTTGATTCT T C ACT AAC T T GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 72 00 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA AT AT CT C C AG ATTCATCTTT 72 60 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7 320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7 3 80 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 744 0 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 75 00 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GT T AC AT AAA CAGTTGTAAT 75 6 0 
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TCCCACTTCG TGTTGGATTT CTCGGATGGC TTGACGCATA TCCAAGCGAA GTTTGGCCTC 7 620 

CAGATTACTA AGTGGCTCGT C CAT G AGG AG AACACTTGGA TTAACCGCTA AGGCGCATGC 7 680 

CAAGGTGACA CGTTGTTGTT GTCCACCACT GAGTTTATCG GGCTTTCGAT CCGCATATTG 774 0 

AGCAATTTGC ATGAGTTCAA GATACTTGTT GGTCTGTTGA ATCAATTCTT CTTTTGGAAC 7 8 00 

CTTCTTTTGC ATAAGACCAA AAGCAACGTT GTCTCGGACA GTCAAATGTG GGAAAATAGC 7 8 60 

GTAGTTTTGG AAAAC CAT C C C GAT AT T G CG TTTGCTGGGT TCCATATTAT TGATTTTTGT 7 92 0 

ATCATCGAAG TAAAATTCTC CACCTTCGAT ACTGTTGAAA CCTGCAATCA TACGAAGAAG 7 980 

GGTCGTTTTC CCACATCCTG AAGCTCCAAG AAGGGTAAAG AG ACT T C C TT TTGGAATTGT 8 04 0 

AATGTTCAAA TTCTCAATAA CAGGGACATC GTGGTAGATT TTTTTGGCGT TAATAATTTT 8100 

GATCTCACTC ATAGTGAACC TCTTTTACTG TTTAGATTGG ATATCTGTAA AGACTTCGTT 8160 

GTATTTCTTA ACGATATCTG ATTTATTCTT GATGACATAA TCATAATCTT CAGTGAGTGT 82 2 0 

TTTGATTTTG TCAATTGGTT TCATGTTTTC GCTTGTTTTA GCATTTTTAC GAACAGGACG 82 80 

GTTAGTAGTG GTTGTACCAA GTGTATCTTG TACTTCTTGA GAGATAATAA AAT CG AT AAA 8 34 0 

TTTCTTGGCA TTTTCCATAT TTTTAGATTT TTTAACGATA GCAGCACTAG CAGGTAGGAA 84 0 0 

GACGGTTCCT TCTTTTGGAT AGACTACCTT AATGTTAGCT CCGTCATTTA AGAGTTTAAC 84 60 

TGCTGGATCT TCATAAGAGA GACCAACAGC CATTTCTCCA TCAGCGACTA C TTT AT AG AC 852 0 

ACTAGATGAA CTTGAACCGA T TTT AC CATC AATAAGTGTG AAAAGATCTT TTACATAAGA 8 580 

CCAAGCCTTA TCATCTTTGT AACCACCTTG AGCTTGTAGC AT AT T TGTT A AT T G AG C AAA 8 64 0 

GGCGCTAGAA GAGTTTGCTG GGTCAGCAGT TGCGATTTTT CCTTTTAGTT CAGGTTTGAA 87 00 

AAGATCGTTA TATCCTTCGA TGTTCATGCC TTTAGTTAAA TCAGGGTTGA CG ATT AAAAC 87 60 

ACT AC CAT CT AGTGTATAAG GAGTAGAGTA GCCAGTTGTG TTTTGATATT CTTTGATAAC 8 82 0 

ATTATCATTT TCTTTTGAAG TATAGTTTTC AAAGAGTTCT CCGTGGGTAG TATATTGTGT 8880 

ATAAGAACCA CCAAAGATAA CAT C AG C T AC AGG AAC T T C T TTTTCTGACT CTAGTTTTTT 8 940 

GAAAAGTTCT CCAGTACCAG CTTGAATCAG TTCTACTTTG ATACCATATT TTTCTTCAAA 9000 

GGCAGGAATA GTTGCTCCAA TTAAGCCCTC TGAGTTTGGT GAATAAACGA CTAGCGAACC 9 0 60 

GCCGTCTCCT TTATCAGATG AACTGTCATC GGCAGATTCA TTAGAAGAAC AAGCAGCATA 912 0 

ATACATCCAT TTCTTTTTCA TGATGGATAC CTCCGTTGTG TTATTTAAGT T TAT TTT AAA 918 0 

ACAATGTAAG CGTTTTTAAA ACATACAATT CTATTCTATA GTGTATTGAA TCTATAACAG 924 0 

T AC AC TTTG A CTGCTAAAAT ATTTCTATAA ATTAATTTGA CTTTCCTGAT AGAGATGTTC 93 00 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 93 60 

TT TT AC T ATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA AT AT GG AG AT ACTTTTTAAA 942 0 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 954 0 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 957 8 



(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 18 0 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 3 00 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 3 60 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 42 0 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 4 80 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 54 0 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA T T CT G AC AG T AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 66 0 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 7 80 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 84 0 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG G C CGTTGC AG AG AAAAG AG C TCAGGAACTT TTGGAAAAAG 102 0 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 10 80 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 114 0 
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CTTCAGCCCT AGATCCTGAG ATGGTTGGAG AAGTTCTGGC TGTTATGCAA GATCTAGCCA 12 00 

AGTCAGGAAT GACCATGGTT ATCGTAACAC ATGAGATGGG ATTTGCCCGT GAGGTGGCAG 12 60 

ATCGTGTCAT CTTTATGGCA GACGGTGTGG TTGTTGAAGA CGGAACACCT GAGCAGATTT 1320 

TTGAACAAAC CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTAGCT 13 80 

TTGTTTAGCT ATTTGTAGCC AGCTTTAAAC GTTAAAGAGA AGATTAGTGA AAAGCTCAAC 144 0 

CAGAGCTTTT TCTTATAGTT TAAAGCTATA GGATTGCCTA GGAAAGAAGT GTTAGAGCTA 150 0 

CATTGTATTT TTTGGTATAA TTAAAGATAT TTGTAAGAAA AGAGAAGTGA TAT G AC AC AG 15 6 0 

ATTATTGATG GGAAAGCTTT AGCGGCCAAA TTGCAGGGGC AGTTGGCTGA AAAGACTGCA 1620 

AAATTAAAGG AAGAAACAGG TCTAGTGCCT GGTTTGGTAG TGATTTTGGT TGGGGACAAT 1680 

CCAGCCAGCC AAGTCTACGT TCGCAACAAG G AG AGGT C AG CCCTTGCGGC TGGTTTCCGT 17 40 

AGCGAAGTAG TACGGGTTCC AGAGACCATT ACTCAAGAGG AAT TGT TAG A CCTGATTGCT 18 00 

AAATACAATC AGGATCCAGC TTGGCATGGG ATTTTGGTTC AGTTGCCATT ACCAAAACAC 18 6 0 

ATTGATGAAG AGGCGGTTCT ATTGGCTATT GACCCAGAAA AGGATGTGGA TGGTTTCCAT 192 0 

CCTCTAAACA TGGGGCGTCT TTGGTCTGGT CATCCAGTCA TGATTCCTTC G AC AC CGG C A 1980 

GGAATTATGG AAATGTTCCA TGAATATGGG ATTGACTTGG AAGGTAAAAA TGCAGTCGTC 204 0 

ATCGGTCGAT CCAATATTGT CGGAAAACCT ATGGCCCAGC TTCTTTTGGC AAAGAATGCA 2100 

ACAGTAACCT TGACTCACTC ACGT AC TC AT AATCTTTCCA AGGTGGCTGC AAAAGCAGAT 2160 

ATTCTGGTTG TTGCAATCGG TCGTGCCAAG TTTGTGACTG CTGACTTTGT CAAACCAGGT 222 0 

GCGGTAGTCA TTGACGTTGG GATGAACCGC GATGAAAATG GTAAGCTCTG TGGGGATGTT 22 8 0 

GATTATGAGG CGGTTGCCCC ACTTGCTAGC CACATTACGC CAGTCCCTGG AGGTGTCGGT 2 34 0 

CCTATGACCA TTACTATGCT GATGGAGCAA ACCTATCAGG CAGCACTTAG G AC AT TGG AT 2400 

AGAAAATAAG ATAAAAATTT TCTGAGGAAA GTGTATTTTC TATAGCTATA TCTAAAATGA 24 60 

TAGAAATGAA TATTAAATTT TAGAAATAAG TTTATAAAAG GAGGTTTGCG CCTCCTTTTT 2 52 0 

GTTGTATAAT GGAGTGAGGT GAT T AG ATG A TTTTAAAAAT T T AT AATGGG GAATATAGTT 2 5 80 

TACAATGGGA TGGAATATAC TACTTAGCAC TAATTGATTA TCCAAATATT CAAGAGTGGG 2 640 

AATTAGAAAA AAT TGCT AAA TTTATAGCTT ACGAAAAACT TCATAAACGT CAAACAAGTA 2700 

TTGAGTGTGC TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 2 7 60 

TTCTGCCACC ATT TACT CCT ACAGATAAAA GAGTAGCCTC G AC TT ATG AC C T AC AT AAG A 282 0 

GGTTAGTGAC TTCAGACTAC TGTAGTCATA CTACGACTAT AG AT G C AGCG ATTTCTATTT 2 880 
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TTAAAACTGG TCGTCTTTTA TCTGCTGTGA AAGCCTTTGG GCGAGATGCT GAGGAGTTGG 2 940 

TTTTGGATAG TCGAAATGCT GC AT C T GAT C CG AT AG ATT A TTTTGACTAT GTCATGTTAG 3 000 

GGTGGTCAAA TACAAGTTCT GGTTATCGAT TGGCGATGGA GCGTTTATTA GGTCGAGCTC 3 0 60 

CT T C AG AG AA AG AAT T AC AA GACAAGTTTA TTCCTGGAGT AAGTTTTCAT TTTATCTATA 312 0 

CAGATTTGAT TAAAGTTCCT GGTTATATTT TTGATGGTTA CCATGCTGTA AAAATTAAGG 3180 

ACATGCTTAA TTTATTAAGT GAGTTGTATA TTTGCATTAT TCCAACTCAT AATAAGAGCC 3 240 

AATTTGAAAA TATTATTCCA AC C AAAAT AC AAGATAGGGT GTATTATCTT GACTATGCTG 3 3 00 

GAGAAGACTT AGAAGAGTGG ACTAAGAAAG TCTATCAAGT TGTTTTAAAA CAATCAGATA 3 3 60 

AAGGATAGTT GAGGAAAAAA CGATGAAAGT GATTGATCAA ACCTTACTAG AAAAAGTCAT 3 420 

TATTGAACGT TCTTGTACAA GTCATAAAGG AGACTACGGT CGTCTGCTGT TGCTTGGTGG 3 4 80 

GACTTATCCT TATGGTGGTG CCATCATCAT GGCTGCTTTA GCAGCTGTAA AAAGCGGTGC 3 54 0 

AGGATTGGTA ACCGTTGGAA CGGACAGGGA AAATATCCCT GCTCTACACA GCCATTTGCC 3 600 

TGAGGCTATG GCCTTTTCTC TGCAAGATCA GTAATTGTTA CAAGAGCAAT TGGAGAAGGC 3 660 

AGAAGTTGTC TTGCTGGGGC CTGGTTTACG AGACGATACG TTTGGAGAAA ATCTTGTAAA 3 72 0 

ACAGGTCTTT GCTAGCTTAA AAAAGAATCA GATTTTGATT G T AG ATGG AG GGGCCTTAAC 3 7 80 

CATCCTTGCT AG G AC AAGTT TGTTGTTTCC ATCTAACCAG CTTATCTTAA CTCCCCACCA 3 840 

AAAAGAATGG GAAAAACTGT CTGGTATTGC TATTGAAAAG CAAAACGAAG GTACAACATC 3 900 

TAGTGCCCTG ACTTCTTTCC CTCAAGGAAC AATTTTGGTA GAGAAAGGTC CAGCTACTCG 3 9 60 

TATTTGGCAA GTTGGCCAGT CTGATTATTA CCAGTTAAAG GTTGGCGGTC CCTATCAGGC 402 0 

GACTGGTGGT ATGGGTGATA CACTGGCTGG AATGATTGCA GG AT T T G C AG GCCAATTTCG 4080 

ACAGGCCAGT CTCTACGAAC GTGTGGCAGT AG C AAC C C AT CTTCATTCAG CCATAGCCCA 4140 

AGAACTATCT CAAGAAAATT ATGTGGTCTT GCCGACGGAA ATTAGTAATT GTCTTCCTAA 4 2 00 

AGTAATGAAA AGATATGTCT AAAAT AGTTA GACAAAAAAT GTTGATAATT TGTATCATTA 4 2 60 

TTCTTAATTC ACAAAAAACG AACGTTTAGT ATTCTTCTTG CTAAGAAACT AAATTTGTTC 4 32 0 

GTTTTTTTAC TCTTGTAAAT CTATTTTTGT TAGAGTTGAT TTGGTTTACA TCCGTACTTA 4 3 80 

AATTGATTTG TTAGAGCTCT ACTTTTATTA AAAAAATTCA ATTTCAAGGA TAAATAAGCA 444 0 

GTATTCTAAA GGTACTTTTA GATGAAATAA AAGCCTTTAC ATGGTATAAT AG AGGT AG C T 45 00 

CTTTAATGGA GGTGTTTGAG TGGAAAATCT GAAGAAAATG GCAGGTATCA CGGCTGCTGA 45 60 

ATTTATCAAG GATGGGATGG TTGTAGGGCT AGGAACAGGT TCTACTGCCT ATTATTTTGT 462 0 

CGAAGAAATC GGTCGTCGAA T C AAGGAAG A AGGCTTGCAG ATTACAGCTG TGACGACTTC 4 6 80 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4740 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCAGT TTAATGGAAT 4 800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GC AAC AC CAT CAAAAGAATA 4 860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4920 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT 5040 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 522 0 

CTAAATTTAA TCGTATTCAT T T GGTGGT AC TGGATTCTGT AGGAATCGGT GCAGCACCAG 52 80 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 53 40 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 54 00 

CTCGTGAAAC TCCTCTTAAG AC TGT AG C AG CTGAAAGCAA TCCAACTGGA T AT GC AAC AA 54 6 0 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA AT C AT GGG AC 552 0 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 55 80 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TATTCAGGAA 564 0 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 57 00 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 57 60 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5 82 0 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 594 0 

T CG AT ACT T A TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG AC T ATGGG AC 60 60 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACG 612 0 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 62 40 

ATGACCCAAC GTATGCAGGA ACGG AT C AC A CTCGGGAATA TATTCCATTG TTGGCCTATA 63 00 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 63 60 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 642 0 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTG AG AG GT A TCAATGTTGG TGGTAAGAAT 64 8 0 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 67 80 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 69 60 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7 02 0 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7 08 0 

CTTGGAGAAT TGG CAGAAGA AAT CGAAAAT CCAGTTGTAG TAG ACT AT G C TG AG AT T C C A 714 0 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 720 0 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 72 60 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7 32 0 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AG ACC AT AT C 7 3 80 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 744 0 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7 500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC G AC T TAT G AA 7 5 60 

AC AC CAG C AG AAATTCGTTC C T AT AAG AC A CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7 62 0 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7 680 

ACTAACTTTG CGGCCGGTTT C C AAG AAG AA CTCAATCACG AAG AAG T T GT AG AAGTG AC T 7 74 0 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7 80 0 

AAAAG AT TT A AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7 8 60 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7 920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAG ATT CT AG GATACTAGCA 7 980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 804 0 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACG AAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 822 0 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 82 80 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AG ACT AG CT C CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 84 00 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 84 60 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TAT AT G AC AG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC C AAAAAT AT C CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 87 60 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 882 0 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9 000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 912 0 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AG AC AC T G C C CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 924 0 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACAAA TT AC G ATT AA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 93 60 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGG CG AC AC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 94 80 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAG AAATC TG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9 840 

CTGAGCTTGA AT AGGACT C A GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9 960 
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AGATAAGACT GACGAAGTCA GTCACATATA TAATCCAAGG CGACGTTGAC GTGGTTTGAA 1002 0 

GAGATTTTCG AAGAGTATAA ACAGAAAGGT AGAGCGCGTG TTCTAATTTG AACACGAGTA 10080 

GAAAACTTTT CTAAAAACAA AAACGAAAGG ATGGGTAAAC TGTATTCGCT GAACTGAATA 1014 0 

CGGGCGACTC TCCTCTAAAT CAAAATTAAG AAAGGAATTG ACCCCACCCT AAAAGTAGTG 102 00 

GGAAAAAGAT AGTTGATCTA GCGAGCATCG CTCACTGCGC CCAACTCCTA TTTTCCCTTC 102 60 

GCTTTTTGAT GGGTTTGGTA TCTTTCTCAA TATAAAATAT AAAATAAAGA AAGGT AG AG C 10320 

GTGTGTTTTG ATTTGAACAC GAGCGGAAAA CTCGGAAAAT AGATAATCTG ACTGAAAAAT 10380 

CAGGATTTCT CGTCAGGTTC CTAATTTTCA GTCGTTTTCT TCTCGCTCTT TGTATCATAA 1044 0 

AT T ATGTCT A TCCATATTGC TGCTCAGCAG GGTGAAATTG CTGATAAAAT TCTTCTTCCT 105 00 

GGGGATCCTC TTCGTGCTAA GTTTATTGCG GAGAATTTCC TTGATGATGC TGTTTGTTTT 10560 

AACGAAGTGC GTAACATGTT TGGTTACACT GGTACTTACA AGGGTCACTG TGTATCTGTC 10620 

ATGGGAACTG GGATGGGAAT GCCATCTATT TCGATTTATG CGCGTGAGTT AATCGTAGAC 10680 

TACGGTGTGA AGAAATTGAT TCGTGTGGGA ACTGCAGGTT CTTTGAATGA AGAGGTTCAT 10740 

GTTCGTGAAT TAGTTTTGGC GCAGGCGGCT GCAACCAACT CAAACATCGT TCGTAATGAC 10 800 

TGGCCACAGT ACGATTTTCC ACAAATTGCT AGCTTTGATT TGCTTGATAA AGCCTACCAT 108 60 

ATCGCCAAAA AACTTGGTAT G AC T AC TC AC GTTGGGAACG TTTTGTCATC TGATGTCTTT 10 920 

TACTCAAATT ACTTTGAAAA GAATATCGAG CTTGGTAAAT GGGGAGTCAA GGCTGTGGAA 10980 

ATGGAAGCAG CAGCTCTTTA CTATCTTGCT GCCCAATACC ATGTTGATGC GCTAGCTATC 1104 0 

ATGACCATCT CTGATAGCTT GGTCAATCCA GACGAAGACA CAACTGCAGA AGAACGTCAA 1110 0 

AATACCTTCA CTGATATGAT GAAGGTTGGT TTGGAAACCT TGATTGCAGA ATAATTATAG 11160 

CCAAAAAGGG GCTCTTTGTC AACTGTAGTG GGTTGAAAAA AAGCTAAGCT TGAGAAAGGA 112 2 0 

CAAATTTCGT CCTTTCTTTT TTGATATTCA GGGCGATAAA AATCCGTTTT TTGAAGTTTT 112 80 

CAAAGTTCCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGTCGCTT 11340 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAC GATTTTCTCT TTGTTCTTTA 114 00 

GAAAGGTTTT AAAGACAGTC T G AAAAAG AG GATGAACCTG CTTCAGATTG TCCTCAATGA 114 60 

GTCCGAAAAA TTTCTCAGGG TCTTTGTTCT G AAAGT G AAA AAGTAAGAGT TGATAGATCT 1152 0 

GATAGTGGTG TTTCAAGTCT TCTGAATAGC TTAAAATCTT GTCAAGAATT TCTTTATTTG 115 8 0 

TTAAGTGCAT GCGAAAAGTA GGGCGATAAA AACGTTTATC GCTsArTTTA CGACTATCCT 1164 0 

GT TGGATG AG TTTCCAGTAA CGCTTGATAG CCTTGTATTC ATGAGATTTT CGTTCAAACT 117 00 

GATTCATAAT TTGAACACGA AAACG AC T C A TGGCACGGCT GAGATGTTGG ATAATATGGA 117 60 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GGCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12 000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12 060 

CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 1212 0 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12 240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CATrTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12 3 60 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 1242 0 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT T CT AC AC AAA AATAGGCTCC 12 54 0 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12 600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12 660 

AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 12 72 0 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 127 80 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACGCT TAAAGGCTCT 12 840 

AAAGAGACAG GTCAATGGCG AAAAAT TG AG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12 900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12 9 60 

AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 1302 0 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG TTAGATAACT TTTCTGCTAT 13 080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 13140 

AT TTGG AGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 132 00 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13 2 60 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13 32 0 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13 380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 13 44 0 
(2) INFORMATION FOR SEQ ID NO : 12 9: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

AT TGG AAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAG AC TGG 12 0 

ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 24 0 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 3 60 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 4 20 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 4 80 

TG AC TTG AG A AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA 54 0 

AGGACATTAC GAATTTCTGG AG AAGG AT T T CCCAAATGAT CAAACAACCA CTGGATTTCT 600 

TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG AT AC ATGGT C 6 60 

AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGGCAGTC T T AAATG ACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 84 0 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 9 00 

CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 9 60 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 102 0 

CAAGTGGTGT CTTGCTACCA CCTGATTTGA GG AG AT TG AG CCAGTCTTCA GCTCCAGTTT 10 80 

CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 12 00 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 12 60 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 132 0 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA T GAG C C AG AG 13 80 

CGAAGCGTTT TTGACGTGGG TC AT TAG AC T GGTTCTCCAA GTAATCACTG AGTAGCAATT 1440 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 15 60 

CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAAGG GTGTACGCGA TATGGGTCCG 162 0 

CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 

GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCGAC TTCATGACCA 1740 

AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCC 1800 

AGTCTGCAAA GGTCATCTTT T C AAGAC CAT TTACCTTGGC AACATGCTTG AGGTATCTCT 18 60 

GAGCGACTGG TGCAAAGTCC TTCATGATGA GGTCAATCTG GCGGTCAAAC ATGACACGGT 192 0 

CCACTTCTTG TTCAGCTAGA AGATAGTCAA AG AC AG AGT C GTATCCCTTC AT AT C AG CCA 1980 

AGAGTTTTTC AG AC T TG AC C TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAC 204 0 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCC C C A GCTCGCATCT TAGTATAAAT GTCCTGCGGA CTGTAGAAAA 222 0 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 22 8 0 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCGTCGTCT 24 60 

GAGGCATAAA ACCATAGTTG CCAATATGGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTGT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2 580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGGTATA 2 640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TGTTCCATTT 2 700 

TCTTCCTCTT ATTTCTCTAA TTC TACT AAA ACACTAAGGG CTGATAAAGC GTAAAGCGGT 27 60 

GCTGTTTCTG CTCGCAAAAT ACGAGGACCT AGGCCTGCCA AAACGGCTCC TTTAGCTTCA 282 0 

AAACTT TCG A TTTCTGCAGG TGAGAGACCG CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 28 8 0 

GCTCCTGTTT CAAGACCAGT GACTGCTTGC AG AAG CGC AG CGGCTTCTCC TTCTTTAGCT 2 940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 30 60 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC C C ACT TGGC A 3120 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 318 0 

GCGATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 32 40 
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AGTTCCACAT TGTCATTTAA TTCTTGGACC AACTCAAACT GACGATTTTC CATATCCAGC 3 300 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3 3 60 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 342 0 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 348 0 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3 540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3 600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3 6 60 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3 720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 37 8 0 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3 840 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3 900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3 960 

AGCCATGCCA GGATCCAGCT TG AT AAT CAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 402 0 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 40 80 

GTCTGCCCAG TCTTCCTCAG CCAAGGCAGT CGTACCTATT TTTAACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4200 

CGTGTCAGGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 42 60 

CTCTCCAAAG CGGTCCACAT TT C C C AC AT A GTCCATACTG TCTTCGATTG CGACTCCTTG 4 32 0 

CGCTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 43 80 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 444 0 

AGGAAAT T C T CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 456 0 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4 62 0 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAAGATG ATATTGTCAA ATTGTTCTTG 4 680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACGC TAGGATAAAT 4740 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4 800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GT T C C C AC AT 4 8 60 

ATTGGCCCAG GG AAT AC T T G CCTTATCATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4 920 

AAACAAAGCA AT CTTGC AAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4 980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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AC ATT CG ACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 52 2 0 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 53 40 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 552 0 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5 58 0 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 564 0 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 57 00 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 57 60 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5 82 0 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GC T TAG AAG A GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5 940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAG C AAAAGC TAGCATAATA AAGACAACAG 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 60 60 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 612 0 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 63 00 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 63 60 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 64 80 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 654 0 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 672 0 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 67 80 
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TTTTATTGAC TATTACCACG ATACCACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 684 0 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 69 60 

TCAACAGATA C C AT AG AAAT AAAAGATTTA TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7 020 

TCATCTTTAA AAGG AT C AT T ATCAATTAAA GACAAGCTAT TG AT AT C TG A TGGCTGAGGT 70 80 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT C AG AAAT C C C CTCTCCTTGT GTAGCTGAGT ATTCAAAATC TGGGAAATGG 72 00 

ACAAAATAAG TCGCTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 72 60 

ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 7 32 0 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7 3 80 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCGC CTCTAGTCTT TATCCAACCA 7440 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7500 

G AC AAC AC C A TTATAACACG TGTTACACGT AT TGT AAAGG AGTGATACTT ATTATTCTAT 75 60 

TATACATAAA AGCCCCTAGA TGTGGTT C T A AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7 62 0 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AGTCAATTGT 7 680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG G AAC AC TG CG 774 0 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7 800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTAGTTT CCTAGTTTGC 7 8 60 

TCTTTGATTT TCAT TGAGT A TGATTAACTC TCAAGTCTTC G AAAT C AGG A TTTTCAACAG 7 92 0 

TTATTACAAG GAGGCG ATT T ACTACTTCAA AAACATCAAT TATTCTATTT TTCATATTTT 7 9 80 

TT C AAC C CAT TATTAGAATG AACTTCTTGG TAAGCAAAAT CAAGTTTAGA TTTAATGTTT 8040 

TCGTACAAAT CTAAAATCTC TTTTGGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

G AAAT AAC T T GAT C ACT AAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 8220 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 82 80 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAAT AT C AC A 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGTCATTCTA CTCAAATCCA 84 60 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCGTTCG CATGCGCCCn GG 8512 
(2) INFORMATION FOR SEQ ID NO : 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 0: 

CTCGTTTCAA GGTTGAGTCT CTTGCAAATC TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 12 0 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 240 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TG AC AG CC AT TGCGtCGCCA 300 

ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 3 60 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 42 0 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 4 80 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG C AG AACG C AC GCAAGACAAT 660 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 72 0 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC AGGTTTTATC 780 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCG ATT T TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGT GGG AC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 9 60 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT AT AC AGTGT C TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 12 00 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 12 60 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AG AGT TGGAA ACAGGCTATC TTGTGGTTGG 13 80 

AG AC C AC C AG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 144 0 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 1560 

TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT 162 0 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 174 0 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 18 60 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTG AT T AC T AC AG C ATT C AAGGGAAATA CGATGAAATC 19 80 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACTATA ATCCAGGGGA AAAT CCAAC A 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 222 0 

C AG AC AGGC T TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 22 80 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TTATCTCAAA 2340 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 24 60 

GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 252 0 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cGATTCGTCA 2 580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 264 0 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2 700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAAT CTCT A ATTACCGCCA 2 7 60 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 282 0 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTGG 2 8 69 
(2) INFORMATION FOR SEQ ID NO: 131: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTTC AT AGGC AG CT 60 

TCAACATCAT TAAAAAAAGA ArGCACTGAA GCAAGTTCTT C AGTGCT CCA CGACAAATCT 12 0 

AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCAGC TCTCATTCTT GCTTCTTTTA 180 

GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 240 

TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTGCTCAA 300 

TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT TTTTGAAAAA 3 60 

TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 42 0 

GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 480 

TTTTTCTAAC CAGGCCTTAG CCTTACGACA T G ATGT AC AG CTCGGTGATA GAAATAGTGT 540 

AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 600 

GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 6 60 

AACGATTGAC AC AATT GG AT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 72 0 

ATT T T TAT AT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 780 

AGATAGCCAA AGAAGTTTTC ATAGAGGGCA AAAAAGAGGA GGAAGGCATG AAGAAAGAAG 84 0 

GTCTCTGGCA AAATCATAAT AACAGGATCC TTGGCTGGAT CAAAAAGCCA GGTATCATCT 900 

CCCACAAAGA G AATTTG AT G GAAAAGAGTA AAGAATTGGT CAAAACCAAT CAAAACTCCC 960 

CCAAGTCCAA T CATC AC AG G T AAG ACT AC T AGAGCCAGGA GACTTTTTCG ATAAAGAGAC 102 0 

AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGGCAG TGT C ACT AG A 10 8 0 

GCTACTAGCT GAACCAAATG AAAG AG AT T C TTGACCACTG CGAAATGGTG C AG AC C AG C T 1140 

GCTGACGAAC GAAAATCAGG CATC TGT AAG ACCTGACTAA AAGGATTGGT C AG AT AATT C 12 00 

AT C AAGAT AT GAAAATTGTA TTGAATGGTT TCTGGTTTTA G AT AG AC TCG ATTCGTTAAG 1260 

TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT CAGAAGGATG 132 0 

GAGAGGGAGA GGAGAAAGAG CAT AG AG C C C C AAAAG AT C A ATT TAG T T T T CAT C AAAATC 13 80 

CCACTCCGCA AGGCTAGAAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 1440 

TGCCTTAGTA AAACCTGTCG TCACCAAGAG CGTTGGAATG CCATTGTCAA TCCCAGCCCG 1500 

AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 15 6 0 

CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG 162 0 

TGTCGCTACT TCAAGCAGCG TAATCAGTGA GCCAGCACCT GGCAAAAGAC CGCGTTCCGT 1680 

CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA T AG C AAG AG T 174 0 
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TGCTGTGGCA AATTTTTCAT AGTCGACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1800 

TTTTTCCTTG TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 1860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA AT C ATTC AT A TAGTCGATGG TTGCCAAAGT 192 0 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AACACTCTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 2160 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 2 22 0 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2280 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCT C C CAT G A 2 3 40 

CCTCAAAAAT CCAGTCCAAG TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATGGTAATCC TTGCTGATCG GTTCTTCCAA GG ACT CAT AC TTCGGTCCAC 24 60 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 252 0 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGAGCTCCT 2 580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 264 0 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACGAT AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 2 7 60 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 2 82 0 

TATGACCGTT CATATCAGCC AT AT C AAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 28 80 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2940 

TTTGTCATGC GAACGTGGTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AG C AAAAG AC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3060 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 312 0 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGGGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 324 0 

CCATAGTATT CAGCATTGTC CCAGTACATG AG AT TAT G AC GACTTTCAAA ACCGGGTTTG 3300 

GAGAAATTAG AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 3 360 

TCAAACATCT CCGCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 342 0 

TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGG C AG AG C A 3540 
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TAAATCAAAT CAATGGAGAT ATTGTCAAAA CCAGCCAGTT TCAGGCGATC GATATTTTCA 3 600 

TAAATATCCT TCTCCAAATG ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3 660 

TGGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 3 72 0 

TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 37 80 

TTAGTCAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CGgTGTTCCA 3 84 0 

CCACCGATAT AAAGGGTTGA CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3 9 00 

TGCTCTAAAT AGCTGTCGAC TGGCTGATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3 960 

TAACAAATCT GGGTACAAAA TGGGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 402 0 

GTAATTATTA T AC C AC AAAG ACT AG AT T C C AGATAAAAAT CACCATCCCC AGATACATAG 4080 

TCCGTCCGGA GATGGTGATG GTTTATTCTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4200 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4260 

TTTCACCTGA CTCAACCTGT TCTTTTGTTT T C AG AAC AAA ATCTGTAGCC TGCTCCTTAA 4320 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 43 80 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTGTT AAAAACAAGG 4440 

CAAGAGCTGC ACCTGAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC AT AAG G ATT C 4 500 

TCCTTTTTTA TTTTTTGAAA AATTTACTTG CAAGACGAAG AG C TG AC AG A CTTGCACCAG 4 5 60 

TCTTGAGTGT TTTTGAACCA GCTGATGAAG CTTTCTTGCT CAAGACACGC GCATGGTCAT 4 62 0 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTGGA TCAATCGTAG 4680 

CCACCTTGAC AT TGAT AT C A TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4 740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4 800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4 8 60 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAG 4 92 0 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAGACCGATA AGG AC AAC TG 49 80 

AC AG C C AT TG GG AC AC TCG A AAGCCGAAGA ACATGAGACT ATCTGTTCGC ATACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCGTCTA AAAATCAGAA T C AAGGC AAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAGTCGGT TGACGGTAGC TCCCCTCAAT AT AC AT C TGG TCACGGATAA 522 0 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TT G C AC CAT A AGCTTCTTGG TTAAAGAAAT 52 80 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 534 0 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 54 00 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATGGCAAA AATCTCTCCT AAATTCTGAC 5460 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5 52 0 

AGGCTACTAA GATAAAATCT AAAAT AT CGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 55 80 

TCATGGTCAA AT AAACCG C A AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 57 00 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 57 60 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 582 0 

ATACGAATAC G AGGAATG t A CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5 8 80 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5 940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 612 0 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA 618 6 



(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAAT C AC A ACCCTTTTTG CAAAATTTTT GAG AT T ATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 12 0 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 2 40 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 3 00 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAAC TTTG A TAGGAGGTCT TGTTTTGTGA 3 60 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 42 0 

ATT TCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 4 80 
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GCGAGATAGA TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 54 0 

TATCTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 600 

GATAGGGTTC TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 6 60 

AGATAAGGTA GCCGCTATGT TCAATCGCAG AATG C AC AAA GACATGATGG T AAATTT C AT 72 0 

CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 7 80 

ACTCAATACT GTCTTGACGC T CAT T AATC A AGTTCCAGTT CACTTGGTTC ACACTGTCAG 84 0 

GAGCATTATA GCT AT T CATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 900 

CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 9 60 

GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 102 0 

GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 1080 

AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 1140 

TAGCTAGAAT TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA G C AC CAT AAA 12 00 

CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 12 60 

GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 13 2 0 

CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 13 80 

TGGTCTTGAC ATCATGAATC C C CAT C AAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 144 0 

GCACCCAGTA TAGAAGAGAA TCAAT CAT AT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 1500 

TTTCATTTCC AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CGATAATAGT 1560 

AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATGG TTATAGACTA 162 0 

CATCCATAAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 16 80 

GAATGACCTG AGCTGGATCA TCTGGATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 1740 

TTTGTGGATC AT AACC C C AG TTGTAGGTTA C ATT TC CATC CTCATCGTAT TCTTTATGAC 1800 

GGTCTGCAAT TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG 18 60 

CAGTTGACTG GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 192 0 

GAAGATGTTC ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 1980 

AGATAACTGC CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 2 040 

CGAAGTTTTC AACTTGCTTT TCTACATGGC TCAGAATAGC TGAACGTTTG C CAT C AGGG C 2100 

TGGTCGCGAT TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATGAGGGAAT TGGACTTGAT 2160 

ACTGATAAGT CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 222 0 
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TGTCCTTATG ATTATAAGAG TAGCTATTGC CTCTTTTCAT CTCAAAAGTC TTCCAAACGG 22 8 0 

GTGCATCATT AGCAGCTGAT TCATAAACGA CAACTTGCAC TTCTGTCGCT GTAGGTGACC 2 34 0 

AGAGAGAAAA ATGAGCCTGA TTGTCCTCTA CACGGCAACC CAATTCTCCT TGGTAACCCC 24 00 

AATGATGATC AAAAC TAG C A CTGTTAATGG CCTTATCAAA GGCAAAAGGA TTTTGATTTT 24 60 

TATAGAAAGG ACTGGCAATA GCAGGATTTT CAGAGTAATA AATCCTATCA TCGCCTTCCA 2 52 0 

AAATCCAGAC CTCTGTTAAT AGGGGATAGT GATTAAAACG GATAGAATAT TCTTTACTAG 2 580 

TTTGACCTGT ATGAACCACA AAATTCAAGC TTTCTATAAC ATGTGAACTT GGGTGTTCAA 2 64 0 

AGCTAAATAA AGCTCCAAAA TAATCTTCTT TGTAGGTTAG CAAATCAATT CGTTGATCCT 2 7 00 

GACTTTTTAC AAAGGAGCAA GTGT C AT ATT CTCCATTCTT ACGATGGTAA TGAATGCGCA 2 7 60 

TAGGGTAGTT ATACATTTTT TATTTTTCCT TTTTACTTTG TTTCTATTTC ACTAATAAAT 2 82 0 

TTTTGTCAAT CTCGTCTCAA TTAACAGACA TAGTCATATT CTCTAAACTC TGTTTTTAAA 2 8 80 

CG AT C C ATT A CAAACTTTCT AGCCATGCCT CATCTCTGAC CTGGATACCA AGTTCTTGTG 2 940 

CTTTTTGCAG TTTACTTCCA GCGTCTGCAC CTACCACGAC GAGGTCGGTC TTTTTAGAAA 3 000 

TACT AC CTGT CACTTTGGCA CCCAGACTTT CGAGTTTACT TTTAGCTTCT GAGCGCTTGA 3 060 

GTCGTTCCAA TTTTCCTGTC AATACCACGG TCAAACCTGA CAAGGCCGCA TCCGCTACTA 312 0 

CCGTCTGTCC TTTATAGTCC AGATTGACCC CAGTTTCTTT CAATTCTCTG AGCAGAATTT 3180 

CAGAGCCTTC TGTCGCAAAA TAAGTCTGAA GACTTTTGGC AATCACGCCA CCTAGACTTT 3 240 

C AAT ACT AG C CACTTCCTCT GAATCTGCCT GAGACAGATT TTCAATTGAA TGGAAATATT 3300 

GAAGTAAAAG CTGACTAACC TTGCTTCCGA CATGACGAAT TCCCAAACCA AATAAGAGCT 3 3 60 

TCTCGGCAGA ATTTTCCTTT GATGCTTGGA TAGCCTGATA CAGTTTAGCA GCGGACTTTT 3 42 0 

CCTTAACTCC CTCTAAAAGG AGGAAATCCT CTTCTTGCAA ACGATAAATA TCCGCCACAT 34 80 

CCTTGACTAA ATTAGCAGCA AAAAGCTTCT CAACAATAGA TGGACCAAGG C CTGT AAT AT 3 540 

TCATAGCATC ACGAGAAGCA AAGTGAATCA AGCCTTCCAT GATTTGAGCA GGGCAACGCG 3 600 

GATTGATACA ACGTAGGGCC ACTTCATCTT CAAAGTGCAA CAAGTCAGAG TTACAACTTG 3 660 

GACAGTTTGT AGGGATATCT AGTTTTTCTT CAGAAACCCG TTTGGACTCT ACCACACGTA 3720 

AAACGGCAGG GATGATGTCA CCAGCCTTAT ATACAATGAC CGTATCGTCT TTTCGGATAT 37 80 

CTTTTTCAGC AAT AT AAT C T ACATTGTGCA GGGTCGCACG GCTAACAGTC GTACCGGCAA 384 0 

GTTGTACTGG TGTTAGATTA GCAGTTGGAG TTACAACACC GGTACGGCCA ACTGTCCAGT 3 900 

CAACTGATAA GAGTTGAGCT TCTTTTTCTT CGGCAGGGAA CTTGTAGGCT ACTGCCCACT 3 9 60 

TTGGAGCCTT AACTGTAAAA CCAAGTTCTT CTTGACTTGC TAGGTCGTTG ACCTTGATTA 402 0 



WO 98/18931 



PCTYUS97/19588 



903 

CCACTCCATC AATATCGTAA GGCAGATTTT CCCGTTCCTG TCCTACTTCT TGGATAAAAT 4 080 

TCCAGATTTC ATCTATGTTT TCAGCCAAGA TTCGCTTAGG ATTGACCACA AAACCTAGTT 414 0 

GTTCTAGGTA CTTCAAACCC TTTTCTTGGC TATCACGAGT TGAAGGGCTG GCTTCTTGAT 4200 

AG AG AAAC GT TGCAAGATTA CGCTTGGCAA CTACTGCTGT ATCCAACTGA CGCAGAGTTC 42 60 

CTGCTGCCGC ATTACGAGGA TTAGCAAATT CAGGCTCTCC ATTTTCTTGG CGCGCTTGGT 43 2 0 

TAACTTGGTC AAAGGAAGCG CGTGGCATGT AACATTCCCC ACGAACTGTG ATATCTAGTT 43 80 

CTTCTGGCAA AGTCAAAGGG ATGTCCTTAA CACGCTTGAG GTTTTCTGTG ATATTTTCAC 4440 

CAATTGAACC ATCTCCACGT GT T AC C C C AG CAACCAAAAT CCCCTTTTCA TAAGTCAGCG 4500 

AGATAGATAA GCCATCGATT TTCAGCTCAC AAATATAGGT CGGATGAGCC ACTTCCTTAC 4560 

GAACACGCGC ATCAAAAGCA TCTAGCTCCT CACATGAAAA AGCATCCTGC AAACTATAAA 462 0 

G AGG AT ACT G ATGACTGTAT TTTTCAAAAC CATCTAAAAC CTTGCCACCA ACACGATGAG 4 680 

TCGGACTGTC TGCTAGCACT TGCTCTGGAT AAGCAGTTTC TAACTCGACC AACTCACGGT 4740 

AAAGGCGGTC ATACTCACTG TCTGAAACCG AGGG ATT AT C GCTGGTATAG TACTCAGTCG 4 800 

CATAGCGATT GAGCAAAGCG ACTAACTCAT TCATTCTTTT ATTCATAAGA CCATTTTACC 4 8 60 

ATAAAACAAG CCCTCCTCAC AAACGAGAAG GGCGGAAAAA ACACTTAGTT TGAAATTATT 4 920 

TTTGAAACTC AAGCAACCTT ATATCAATTT TTCAAAATGA GTTCGAACAT ATCCGAGAGC 4980 

TAAGAAATAT AAGGCTACAA CTCCAAGTCC AATAATCAAG AAAGAATAAA G AT GG AC AC T 504 0 

TGGCAAGACT GTCATAAATC CTTTTGCAAT AGGCATAAAT AGAATAGCTA AGGTAAAAAT 5100 

TGTACTCAGT ACTCTTCCAA GAAATTCGCT CTCAACCTTG GTTTGTACTT GAGTAAAAAA 516 0 

GTGAATATTA AAAATCGTCA TAAACAATTC ACAAACTAAA TTTCCAGAAA AGGAAAGAAA 522 0 

AGTTGGAAGT GGTAATCCCA T C AT AAAAAC TCCGACACCT GTCAAAGCCA GTAAAATCAA 52 80 

AAGATTATAA ATATTAGCTT TAATTTTACT AGCTAGAAGA GCCCCAATGA TGGAACCAAT 53 4 0 

AGCCCCCATA GTTAAAATAC TTGCATAGGC TCCTTCTGAC CCGTAAAGCT GATTCGAAAA 5400 

GGGAAGTAGA AATTCAAAAG CTGCAAAAAA GAAATTAACG CTGGAAGCTA CCAGCAAAAG 54 60 

GAAGAAAATT TCTTGCTGAT GCCAGATATA GTGTAACCCA TCCTTGATAT C T AC AAAAAT 5520 

ATCTCTCCCA GTAAAAGCCT TTTTCTCTTG AACTTTTGCT TCCTCTTTTG GAAGGAAAGC 5580 

CACTAGAACA AAAGCAATGA AAAAAGTCAG CGAGTCTAGC AGTAGCGTCA TAT G GAG ACT 5 640 

TGCAAACTGT AAAACAAGGA AGGAAAGAAC AGGAGAGCTA ACACCTACAA CCTGCAAAAC 5700 

CAGCTCTAAG CGAGAATTAT AG AT C AC AAT CTCATCTTTC TCCACCACTT CAGTTATGAT 5760 
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AGCTTTATTG GCTGTGCGAG AAAAGGCAAA AGCAATAGCC TGCACAATGT TAG C AAC AAT 582 0 

CAAAGCGCCA ATCATCCAGC TATCATTCCT TATGAAAGAA AT AG CC AG AC AAAGAATCCC 5880 

ACAAACAAGA TCTGCCGTCA TTAAAATCTT ACGACGAGAA AAACGGTCTG AAATAACTCC 594 0 

G CC AAAGGG A TTGACGAGAA TAGATGTGAC GAGCTCAGAA ATCTGATACA TTCCTAAAAC 6 000 

TGTCTGTCCT ATAGTCCCCA TAGAAGCCAA C C AG AC ACT A TTTCCATAAT CATAGAGCAT 6060 

ATTTCCCATT TTATTGATAG CCCCACGGCT AATCAACTGC ACTGCATAGC GATTCATATT 6120 

AAAGCTCCTC TCAAATTTTG AAACTATTGT ATCAAAACCG AAAGGAGCTT TTTATTTTTT 6180 

CCCTTATTTG GGAAAATTAA CTTTTGACAA ATTTTTCGTA GTGTTCCTGA TAATAGGCTA 624 0 

CTTGCTCTGG AAGACCTAAC ACATCAAAAA TATGCATGGC CTCTTGCATC TGCTTACAGC 63 00 

CTTCTTTACA CTGTCCTTTT TGATATAAGG CAAAACCTTT TAAATAATGG AAAACATTAC 63 60 

GCTCATAAAG CTTAATACCT TTGTCAATAA TCTTCTCTGT ATAAGCCTCA AAATAGTTGG 642 0 

C AT T AT AAAA AGAAGAATGC TCTAAACAAT GCTGGTAACA ATTGAGGGCC AAAATCAACA 64 8 0 

CTAATCTCTT ATGGCGACTA ATCTCTTGGT AAAATTCCTC CCTCTCCATA ACTTCTCTAC 6 540 

CAATCCGAGT GACATAGTCT ACATCGTAGA AACTATAGAG GTTACCGAAA AGAATCAACT 6 600 

CATACATGGT CCATTCTTCT GTTTTGAAGA GATAATCTGC TACCTTACCC AAATCATCCT 6 6 60 

GCTTCATATC ATAACTCGCA TCTCTTTGAC AAATCAGACC TTGTAGCAAA ATCCAGTTCA 6720 

GCTCAAAATA AAGGGGAGTC GTCGAACTCT TAGACTTTTC AAGTTGTTCT CTTTGAAGCT 67 80 

TTTGAAAACC TGCAATATCG TTTGAATAGT AAAGTGGGAT AATCTGTGCC ATCATAGACA 6 840 

CATGTTCATG ATTATGAAAA TTCCTTGCCT TATCCATGAA ATTTTCGATT GTTACATGAA 6900 

TGTTATCCAA AATCTCAAAG AAACGGGAGA CTGCCAGGTC AGACTCCCCA AGCTCAAAGC 69 60 

GAGATAACTG AGAGGTAGAG CAGGATTCGC CTGCTGCTTC CTTTAAAGAA TAATTTCCAC 7 02 0 

TTGTTCGAAA TTCACGAAAT ACTTTTCCAA G ATGTT C CAT CTTTACACCT GCTCTGATAA 7 080 

TTCTTCCCAC TCAAGCATAG CTTCTTCCTG ACGATGGCTG ATTTTGTCCA GCTCAGCCTG 7140 

TAATTCCATG AGTTTGTCGG CATCGTTTGT TTCCAACATT TGTTCAGAAA TGGCTTGGCT 72 00 

TTGACTTTCT AGCTCTTCAA TTTCAGCTTC TAGACTTTCG ATTTGTCGCA TGAGTTTGCG 72 60 

AACTTCTTTT TGACTTTCTT TCTGGGCCTG ATAGTCATTG ACTGGACTTG CTTCCTTTGC 7320 

TTGATTGCTA GTTGAAGCTT CCTCAGTCTG ACTCATTTCT GCTGTTGCTT TCTTCTCAAC 7 3 80 

ATAGTAGTCG TAATCTCCAA GGTAGAGAGT TGAACCATTC TCAGACAATT CCAAAACATG 7440 

AGTTGCCACA CGATTGATAA AGTAACGATC AT G ACTG AC A AACAGCAAGG TTCCATCAAA 7500 

GTCAATCAAG GCATTTTCTA GCACTTCCTT ACTATCAATA TCCAAGTGGT TGGTCGGCTC 75 60 
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ATCCAGAATC AAAAAGT T AT TGTTTTCCAT AGACAATTTA GCTAAAAGCA AACGAGCTTT 7 62 0 

TTCGCCACCA GAT AG CAT GC CGACTGATTT TTTAACATCA TCTCCTGAGA AAAGGAAGGC 7 6 80 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7 800 

ATT AGCGC C A AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 78 60 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7 92 0 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACAGCTG CATTTTCAAC 7 9 80 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 804 0 

GCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTGAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8160 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 822 0 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 82 8 0 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG AC G AT AATG A GGGCACCGCT 834 0 

ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 84 00 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGG AG CAT T TTGGCAAGTG CCAAACGAGT 84 60 

ATTTTGACCA CCAGAAAGCT CAGCAATTTT CATCTGCCAC ATAGACTCGT CAAACTTGAA 852 0 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTGCTT GGCGAAAATT 8 580 

CTCAGATAAG CGGTCATAAT C TG AC AT C AG TTTATCCAAA TCCTCACCAG ACTTTTCACC 864 0 

CATCTCCAGC TCCATCTGAC GCAGTTGTCT CTCCGTCCGA CGCAAATCAT TAAAGACATG 87 00 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 87 60 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 882 0 

AATCTTCAAA AGAGTAGACT TACCTGCACC ATTTTTCCCA ACAAGAGCAA TCCGATCTCG 888 0 

TTCATCAACC TGCAGGTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 894 0 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT T ATGT AT AT T 9060 

T TAT ATT AG A TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTTCATCATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 9240 

ATTGTTTCGT ACTATTTTAG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 9 3 00 
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TTTTAACTCT AT T T ATT AC T AGATTTCATA ATTAAAAAAC CTACTGACCA AG C T AG AAAG 



9360 



CTTGATACAA TAGGCTTTTT AAAG AC T GAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 



9420 



GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 



9480 



CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 



9540 



T 



9541 



(2) INFORMATION FOR SEQ ID NO : 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATGCTT TCTAAGGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 12 0 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTG GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 3 00 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 3 60 

GGACAGATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 42 0 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATG AC CAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAAT AC C AC C 54 0 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 72 0 

AAGCCATTCG C ACT T ATGG A TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 7 80 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA AC AT TG AC AA GTTTGGTTCT GAAACAGCCC 84 0 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT C TT CTAGAAA CACACGAAGC ATATCAGACA TATCATCGGT 102 0 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 



WO 98/18931 



PCT/US97/19588 



907 

CTTGTCGTCA TACCAGTTTT T GAT AC C AC A GAGTTGGGGT TTGGAAATGA TCAGACCAGT 12 00 

TTCTTCTTTC ACTTCACGAA TG AC AG CATC GACAAAGGAT TCGCCACGTT CAACATGACC 12 60 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCGG TCTTGGACCA GGACCTTATC 132 0 

TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 13 80 

CAACCTTTAA TCTTTAATCA TAATGCAGAC TTCCCGCCAC CCAGCCGGTA C AG AGGG C AG 1440 

AAGTGATGTT AAAGCCACCC GTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA 1500 

GGCCAGGTAC CAGCTTACTT TCAAGGGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC 1560 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT TACAGGAATT TTAAGTTCTT 1620 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1680 

ATCCTTGTAC AAAAAATTCG GCCAAGCGTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA 174 0 

AGGATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCTCAGAA AGTTGAGGCA 1800 

AAACATCGAG TG AG AG AAC C TCCCCACCTT TGACAAAGCT AG AC ATG CGT AGGGCAGCAG 18 60 

GACCTGACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT GATGACATGC TTACCATAAC 192 0 

TTAGGGTCAC ATCGTCCAGA GAAATACCTT GTAAGGCTTT ATGTGGAAAA TCTGTTAATA 1980 

AAGGACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCGT 2 040 

GACCAAAACC AGTCGAACCA GTCGAAGGAT AAGACTTACC ACCTGTTGTG ACAATGAGTT 2100 

TCTCACAAGT GAAGGTTTGA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGGTGATT TTCTTTTCCA 222 0 

AAGCTTCGAT AATAGTCCGA GACTTGTCAC TGGCTGGAAA GACGCGTCCG TGGTCTTCGA 22 80 

CCTTAAGTTT AAC AC CAT T T TCTGTAAAAA AGTTGATGAT GTCATGATTA TCGAACTGGG 2 340 

AGAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTAGCAGG TTGTCTAAGC 2 400 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT TTTCCAAGTT 2 4 60 

TCCGATTTTT TTCGATGAGG AGGGTTTTCT GTCCATAAAA GCTACTGGAA ATCGTAGCCA 2520 

TCATACCAGC AGGTCCCCCA CCGATGACAA T AGT AT C AAA ATGTTTCATA GCTCTATTGT 2 580 

AC C AC AAAAA AACAAGAGAT G ATGGTC AC C TCTTGTCAAG AATGCAATTA ATCAATTTCA 2 640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 27 00 

ATTTTAATAT CCGCTTGTGG GAAGGTTTCA CGGATTCGCT CTGAGAGCTG TT G AC AAC AT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CAT AT C C AG C TTTTACTAAC 2 82 0 

T CAT C AT AGG CAGCTTGAAC TGATTTCTTT GATCCCCTTG CTTTTTGTAG CAATTCGAGA 2 880 
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GTCCCAGTTT C ACT AG CTT T TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2 94 0 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG G AC AAAG AG C 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AG C AG AT AAA 312 0 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 32 40 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3 3 00 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC AT C AAC AAAG 33 60 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGGT 342 0 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 34 80 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 134: 

CGATTGATTT TTTTAAAGCG TT CG AT AG AG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 12 0 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 3 00 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 3 60 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 42 0 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 4 80 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 54 0 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 6 00 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CG ATT C AG AA 7 80 
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CGAGATAAGG CAAGGAAAGA GGTCGAGGAA TATGTAAAAA AAATAGTGGG TGAGAGCTAT 840 

GCAAAATCAA CTAAAAAGCG ACATACAATT ACTGTAGCTC TAGTTAACGA GTTGAACAAC 900 

ATTAAGAACG AGTATTTGAA TAAAATAGTT GAATCAACCT CAGAAAGCCA ACTACAGATA 960 

CTGATGATGG AGAGTCGATC AAAAGTAGAT GAAGCTGTGT CTAAGTTTGA AAAGGACTCA 102 0 

TCTTCTTCGT CAAGTTCAGA CTCTTCCACT AAACCGGAAG CTTCAGATAC AGCGAAGCCA 1080 

AACAAGCCGA CAGAACCAGG AGAAAAGGTA GCAGAAGCTA AGAAGAAGGT TGAAGAAGCT 1140 

GAGAAAAAAG CCAAGGATCA AAAAGAAGAA GATCGTCGTA ACTACCCAAC C ATT AC T T AC 12 00 

AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TTAAAAAAGC GGAGCTTGAA 12 60 

CTAGTAAAAG TGAAAGCTAA CGAACCTCGA GACGAGCAAA AAATTAAGCA AGCAGAAGCG 132 0 

GAAGTTGAGA GTAAACAAGC TGAGGCTACA AGGTTAAAAA AAATCAAGAC AGATCGTGAA 13 8 0 

GAAGCAGAAG AAGAAGCTAA ACGAAGAGCA GATGCTAAAG AGCAAGGTAA ACCAAAGGGG 144 0 

CGGGCAAAAC GAGGAGTTCC TGGAGAGCTA GCAACACCTG ATAAAAAAGA AAATGATGCG 15 0 0 

AAGTCTTCAG ATTCTAGCGT AGGTGAAGAA ACTCTTCCAA GCCCATCCCT GAAACCAGAA 15 60 

AAAAAGGTAG CAGAAGCTGA GAAGAAGGTT GAAGAAGCTA AGAAAAAAGC CGAGGATCAA 162 0 

AAAGAAGAAG ATCGCCGTAA CTACCCAACC AATACTTACA AAACGCTTGA ACTTGAAATT 1680 

GCTGAGTCCG ATGTGGAAGT TAAAAAAGCG GAGCTTGAAC TAGTAAAAGA GGAAGCTAAG 1740 

GAACCTCGAA ACGAGGAAAA AGTTAAGCAA GCAAAAGCGG AAG TTG AG AG TAAAAAAGCT 1800 

GAGGCTACAA GGTTAGAAAA AATCAAGACA GATCGTAAAA AAGCAGAAGA AGAAGCTAAA 1860 

CGAAAAGCAG CAGAAGAAGA TAAAGTTAAA G AAAAAC C AG CTGAACAACC ACAACCAGCG 192 0 

CCGGCTCCAA AAGCAGAAAA ACCAGCTCCA GCTCCAAAAC CAGAGAATCC AGCTGAACAA 1980 

CCAAAAGCAG AAAAAC C AG C TGATCAACAA GCTGAAGAAG ACTATGCTCG T AG AT C AG AA 2040 

GAAGAATATA ATCGCTTGAC TCAACAGCAA CCGCCAAAAA CTG AAAAAC C AGCACAACCA 2100 

TCTACTCCAA AAACAGGCTG GAAACAAGAA AACGGTATGT GGTACTTCTA CAATACTGAT 2160 

GGTTCAATGG CGACAGGATG GCTCCAAAAC AATGGCTCAT GGTACTACCT CAACAGCAAT 222 0 

GGCGCTATGG CGACAGGATG GCTCCAAAAC AATGGTTCAT GGT ACT AT C T AAACGCTAAT 2280 

GGTTCAATGG CAACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTACCT AAACGCTAAT 2 34 0 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2400 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 246 0 

GGTGATATGG CGACAGGTTG GGT GAAAGAT GG AG AT AC CT GGT ACT AT C T TGAAGCATCA 2 520 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2 64 0 

TGGGTAAACT AAACCTAATA TAACTAGTTA AT AC TG AC TT CCTGTAAGAA CTCTTTAAAG 27 00 

T ATT C C C T AC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAG T T TAG AT 27 6 0 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TT T AT AG CG C TAGAGACTAC GGAGTTTTTT 2 82 0 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2 8 80 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CAGGGTGCTG GCTTGCGAAA 2 94 0 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG AT AG TG AGG A GC G AG AC C GC 3 000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3 06 0 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 312 0 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG AT AT C AC CGT 32 4 0 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3 3 00 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3 3 60 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG C AAG AG AT GG GTCCGCCCAG CCGGTACGCA 3 42 0 

TTTTGACAGT AAGT GGG AT A T C AAGG AC AG ACTGGACCTT GTTGATGATG GAGTAAATCT 3 4 80 

TGTCTGGATC CTTGAGCCAC AT AGC AC C AG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3 54 0 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3 600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3 6 60 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG T C AG AG ACC A 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3 84 0 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3 9 00 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3 960 

AAGT C TG AGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4 02 0 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4 080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AG AT AG C TGG CATTTCTTGG 414 0 

ATGCGTTTTT C AAAGCG AG C AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 42 60 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT T T C TC C AG AG 432 0 
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AT GAGGGG AG TTATAGAGTT GTAAGGATTT CCAGTACCGT AGTCTGTGAT AACGAGGAAT 4380 

TGACCATTTC CAACAAAAGG TCCGACTAGG ACTTCACCAG TCGCAGTCTT TTTGATGTCA 444 0 

ACACCAGGAT TTTGAACATA GCCTTTGACG TTCCCCTTGG TATCAGCGAC GGTGATAATA 4500 

GCACCTAGAG AGCTAGATCC CAACACCTTA ACTGTAAGTT TGGTATTTCC TTTTTCATTG 4560 

GCTGCGAGAA TCTGGCTAGC GATAAGAGTT CGACCAAGCG CTACAGTTGA GCTAGCTTGG 4 62 0 

GTTTGATGTT TTTCTTGAGC AGTGCGGACG GTTTCAGTGC TATCAAGGAC AAAAGCACGA 4680 

AAGGcTCCGC TTTCTGATAT AGTTTTAATA ATTTTATCCA TAGCTACTAT TTTAGCATAA 474 0 

AAATGCCCAA AGGGGGAGCC GTGTGTTTAC TGATTTTCAG GATAATGGAC C AGGAAAT C A 4800 

GCATGAAAAT AAAAAGAGAA ACAGATTATT TTAGCATTTG TCAGATTTAT GCTATGCTTA 4860 

AGGTAGAAAA TGAAAGGGAT AACAAATGTA TTTAGGAGAT TTGATGGAGA AAGCCGAGTG 492 0 

TGGTCAATTT TCAATACTTT CCTTTCTATT ACAAGAGTCT CAGACGACCG TCAAGGCTGT 4 980 

AATGGAAGAA ACAGGATTTT CAAAAGCAAC CCTAACCAAA TATGTCACCC TGCTCAATGA 504 0 

CAAGGCTTTG GATAGTGGCT TAGAGCTGGC TATTCACTCA GAAGATGAAA ATCTGCGTCT 5100 

GTCTATCGGT GCAGCTACCA AGGGGAGAGA TATTCGGAGC TTGTTTTTGG AGAGTGCTGT 5160 

TAAATACCAG ATTTTGGTTT ATCTTCTCTA CCACCAACAG TTTTTAGCCC ATCAGCTGGC 522 0 

TCAAGAATTG GTGATTAGCG AGGCTACGCT TGGTCGTCAC TTGGCTGGTT TAAATCAGAT 5280 

TTTGTCAGAA TTTGATTTAT C CAT C C AAAA TGGCCGTTGG CGAGGTCCAG AG C AT C AG AT 5340 

TCACTATTTC TATTTCTGTC TTTTCCGAAA GGTCTGGTCG AGTCAGGAAT GGGAAGGTCA 5400 

CAT G C AG AAA C C AG AG AG AA AACAGGAGAT TGCCAATTTA G AGGAAAT CT GCGGTGCAAG 54 60 

TTTGTCTGCG GGGCAGAAAT TGGACTTGGT TCTCTGGGCT CACATCAGTC AACAACGTCT 552 0 

TCGGGTCAAT GCTTGTCAGT TTCAAGTCAT AGAAGAGAAA ATGCGAGGGT ATTTTGACAA 5580 

TATCTTTTAT CTTCGTTTGC TGAGAAAGGT TCCGTCCTTT TTTGCTGGGC AACATATTCC 5 64 0 

ACTAGGAGTT GAGGATGGTG AGATGATGAT ATTCTTCTCT TTTCTCCTAT CTCATCGCAT 57 00 

TCTTCCTCTT CATACTATGG AGTAT ATT CT TGGTTTTGGA GGGCAGTTGG CAGATTTACT 57 60 

GACGCAATTG ATT C AAGAAA TGAAGAAGGA GGAACTATTG GGGGATTATA CAGAGGACCA 582 0 

TGTCACCTAT GAACTCAGTC AGCTTTGTGC TCAAGTCTAT CTCTATAAGG GCTATATTTT 58 80 

ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA T ATT T AC TG A TGGAACATGA 5940 

TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGACAGA 6000 

TTTAGATAAG AAGATTCTCT GGGAATGGCT CCAGTTAATC GAATATATGG CTGAAAACGG 6060 
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TGGCCAGCAT ATGCGGATTG GTCTGGATTT GACATCTGGT TTTCTTGTCT TTTCAAGGAT 612 0 

GGCAGCCATT TTGAAACGGT ATTTGGAATA CAATCGTTTT ATTACCATTG AAGCTTATGA 6180 

CCCTAGTCGG CAT TAT G ATT TGCTGGTTAC CAATAACCCG ATTCATAAGA AGGAACAGAC 62 40 

ACCAGTCTAT TATTTAAAAA ATGACTTGGA TATGGAGGAT TTGGTAGCGA TTCGCCAGTT 6300 

ATTATTCACT TAAAAGGCTT GGTTAATCCA GGTCTTTTTT GTGAAATTCA CACAATCTCC 6 3 60 

T C AC ATTTTT TTAAAAATTA AAAAAAGTTG ATAAACAAGA AAGCGCTTTA TTTTGTATAC 6420 

TAGTAAGTGT AAAGAGGAAA CACCTCAAGA TCTTTATCAG GAGGACAGTA CATGTCACAA 64 80 

GAAAAATACA TCATGGCCAT TGACCAGGGA ACTACAAGTT CTCGTGCCAT CATTTTCAAC 654 0 

AAAAAAGGGG AAAAGGTTAG CTCGAGTCAA AAAGAGTTTA C CC AG AT T T T CCCTCAGGCA 6600 

GGTTGGGTTG AG C AC AATG C CAATGAAATT TGGAACTCTG TTCAGTCAGT TATTGCGGGT 6660 

GCTTTCATCG AAAGTGGTGT CAAGCCAAAT CAAATCGAGG CAATCGGGAT TACCAACCAA 6720 

CGTGAAACAA CGGTTGTCTG GGATAAGAAA AC AGG AC T T C CTATCTACAA TGCTATCGTT 678 0 

TGGCAGTCAC GCCAGACAGC ACCTTTGGCT GAGCAACTAA AAAGCCAAGG TTATGTGGAA 6840 

AAATTCCATG AAAAGACTGG TTTGATTATT GATGCTTACT TCTCTGCTAC CAAGGTTCGT 6900 

TGGATTTTGG AT C ATGT AG A AGGTGCTCAA GAGCGAGCAG AAAAAGGGGA ATTGCTCTTT 69 60 

GGTACTATCG ATACTTGGTT GGTTTGGAAA TTGACTGACG GTGCGGCTCA CGTGACTGAC 7 02 0 

TACTCAAATG CAGCTCGTAC CATGCTTTAT AACATTAAAG AACTCAAATG GGATGATGAG 7 080 

ATTTTGGAAA TCCTTAACAT TCCGAAGgCT ATACTTCCAG AAGTTCGTTC TAACTCCGAA 7140 

AT C T ACGGC A AGACAGCTCC ATTCCATTTC TACGGTGGAG AGGTGCCAAT CTCAGGTATG 72 00 

GCTGGGGACC AACAAGCAGC CCTCTTTGGA CAGTTGGCTT TTGAGCCAGG TATGGTTAAG 72 60 

AATACTTATG GAACAGGCTC TTTCATCATC ATGAATACTG GGGAAGAGAT GCAGTTGTCT 7 32 0 

GAAAACAACC TCTTGACAAC CATTGGTTAC GGAATCAACG GTAAGGTTTA TTATGCCTTG 73 8 0 

GAAGGTTCTA TCTTCATCGC AGGAAGTGCT ATTCAGTGGC TTCGTGACGG TCTTCGCATG 7440 

GTTGAAAATT CACCAGAATC TGAAAAATAC GCTCGTGATT CTCACAACAA CGATGAAGTT 7500 

TATGTCGTTC CAGCCTTTAC AGGTCTAGGC GCTCCATACT GGAACCAAAA TGCTCGTGGT 75 60 

TCCGTCTTTG GTTTGACTCG TGGAACAAGC AAAGAAGACT TTATCAAGGC GACTTTGCAA 7 62 0 

TCTATTGCTT ATCAAGTGCG TG AT AT CATC GACACCATGC AAGTGGATAC TCAGACCGCC 7 68 0 

ATTCAAGTAC TGAAGGTGGA TGGTGGTGCA GCCATGAACA ACTTCCTCAT GCAGTTCCAG 7740 

GCGGATATTT TAGGCATTGA CATTGCACGT GCTAAAAACC TGGAAACAAC AGCTCTAGGA 7800 

GCGGCCTTCC TAGCAGGTTT GTCAGTAGGG TACTGGAAAG AC TTGG AC G A GTTGAAACTC 7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 7 92 0 

TACAAGGGCT GGAAGAAGGC TGTGAAAGCA ACTCAAGTCT TTGCGGAAGT AGACGACTAA 7 9 80 

TACTGGCAGA ATAAAGCGAT TTATTTAGAA AGTGTGTAAA TATGGAATTT TCAAAGAAAA 8040 

CACGTGAATT GTCAATTAAA AAAATGCAGG AACGTACCCT GGACCTCTTG ATTATCGGTG 8100 

GAGGAATCAC AGGAGCTGGT GTAGCCTTGC AGGCGGCAGC TAGCGGTCTT GAGACTGGTT 8160 

TGATTGAAAT GCAAGACTTT GCAGAAGGAA CATCTAGTCG TTCAACAAAA TTGGTTCACG 82 2 0 

GAGGACTTCG TTACCTCAAA CAATTTGACG TAGAAGTGGT CTCAGATACG GTTTCTGAAC 82 80 

GTGCAGTGGT TCAACAAATC GCTCCACACA TTCCAAAATC AGATCCAATG CTCTTACCAG 8340 

TTTACGATGA AGATGGAGCA ACCTTTAGCC TCTTCCGTCT TAAAGTAGCC ATGGACTTGT 84 00 

ACGACCTCTT GGCAGGTGTT AGCAACACAC CAGCTGCGAA CAAGGTTTTG AGCAAGGATC 84 60 

AAGTCTTGGA ACGCCAGCCA AACTTGAAGA AGGAAGGCTT GGTAGGAGGT GGAGTGTATC 852 0 

TTGACTTCCG TAACAACGAT GCGCGTCTCG TGATTGAAAA CATCAAACGT GCCAACCAAG 8580 

ACGGTGCCCT CATTGCCAAC CACGTGAAGG CAGAAGGCTT CCTCTTTGAC GAAAGTGGCA 8 64 0 

AGATTACAGG TGTTGTAGCT CGTGATCTCT TGACAGACCA AGTGTTTGAA ATCAAGGCCC 87 00 

GTCTGGTTAT TAATACAACA GGTCCTTGGA GTGATAAAGT ACGTAATTTG TCTAATAAGG 87 60 

GAACGCAATT CTCACAAATG CGCCCAACTA AGGGAGTTCA CTTGGTAGTA GATTCAAGCA 8 82 0 

AAATCAAGGT TTCACAGCCA GTTTACTTCG ACACAGGTTT GGGTGACGGT CGTATGGTCT 8880 

TTGTTCTCCC ACGTGAAAAC AAGACTTACT TTGGTACAAC TGATACAGAC TACACAGGTG 8940 

ATTTGGAGCA TCCAAAAGTA ACTCAAGAAG ATGTAGATTA TCTACTTGGC ATTGTCAACA 9 000 

ACCGCTTCCC AGAATCCAAC ATCACCATTG ATGATATCGA AAGC AG CTGG GCAGGTCTTC 90 60 

GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG TGGAAATAAC GGTACCATCA 912 0 

GTGATGAAAG CTTTGACAAC TTGATTGCGA CTGTTGAATC TTATCTCTCC AAAGAAAAAA 9180 

CACGTGAAGA TGTTGAGTCT GCTGTCAGCA AGCTTGAAAG TAGCACATCT GAGAAACATT 92 40 

TGGATCCATC TGCAGTTTCT CGTGGGTCTA GCTTGGACCG TGATGACAAT GGTCTCTTGA 93 00 

CTCTTGCTGG TGGTAAAATC AC AG ACT AC C GTAAGATGGC TGAAGGAGCT ATGGAGCGCG 93 60 

TGGTTGACAT CCTCAAAGCA GAATTTGACC GTAGCTTTAA ATTGATCAAT TCTAAAACTT 9420 

ACCCTGTTTC AGGTGGAGAA TTGAACCCAG CAAATGTGGA TTCAGAAATC GAAGCCTTTG 94 80 

CGCAACTTGG AGTATCACGT GGTTTGGATA GCAAGGAAGC T C ACT AT CTG GCAAATCTTT 9540 

ACGGTTCAAA TGCACCGAAA GTCTTTGCAC TTGCTCACAG CTTGGAACAA GCGCCAGGAC 9600 



WO 98/18931 



PCT/US97/19588 



914 

TCAGCTTGGC AGATACTTTG TCCCTTCACT ATGCAATGCG CAATGAGTTG ACTCTTAGCC 9660 

CAGTTGACTT CCTTCTTCGT CGTACCAATC ACATGCTCTT TATGCGTGAT AGCTTGGATA 9720 

GTATCGTTGA GCCAATTTTG GATGAAATGG GACGATTCTA TGACTGGACA GAAGAAGAAA 97 80 

AAGCAACTTA CCGTGCTGAT GTCGAAGCAG CTCTCGCTAA CAACGATTTA GCAGAATTAA 984 0 

AAAATTAAGA AAAAATAAAA GAGGTGGAGG GCAGCATTCC TTGTCGCCCG TCCCTTCTTT 9 9 00 

TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 9 9 60 

TGATTCTTCT AGGAAATGGT GTTGTTGCAG GTGTGGTTCT TCCTAAAACC AAGAGCAATA 10020 

GCTCAGGTTG GATTGTGATT ACTATGGGTT GGGGGATTGC AGTTGCGGTT GCAGTCTTTG 10080 

TATCTGGCAA GCTCAGTCCA GCTTATTTAA ACCCAGCTGT GACCATCGGT GTGGCCTTAA 1014 0 

AAGGTGGTTT GCCTTGGGCT TCCGTTTTGC CTTATATCTT AGCCCAGTTC GCAGGGGCCA 102 00 

TGCTGGGTCA GATTTTGGTT TGGTTGCAAT TCAAACCTCA CTATGAGGCA GAAGAAAATG 102 60 

CAGGCAATAT CCTGGCAACC TTCAGTACTG G AC C AG C C AT CAAGGATACT GTATCAAACT 1032 0 

TGATTAGCGA AATCCTTGGA ACTTTTGTTT TGGTGTTGAC AATCTTTGCT TTGGGTCTTT 103 80 

ACGACTTTCA GGCAGGTATC GGAACCTTTG CAGTGGGAAC TTTGATTGTC GGTATCGGTC 1044 0 

TATCACTAGG TGGGACAACA GGTTATGCCT TGAACCCAGC TCGTGACCTT GGACCTCGTA 1050 0 

TCATGCACAG CATCTTGCCA ATTCCAAACA AGGGAGACGG AGACTGGTCT TACGCTTGGA 10 5 60 

TTCCTGTTGT AGGCCCTGTT ATCGGAGCAG CCTTGGCAGT GCTTGTATTC TCACTTTTCT 1062 0 

AGTTTATACT CTTCGAAAAT C AAAT T C AAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG 10680 

TACAGCTTGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAAAACAA 10740 

TTATGTTGAT AGAGCTTGGG CAAGAGCCCA ATTTCAGCAA AAAATGAAGT AAATCTTCTC 10800 

ATAATAAAAC GCATCATATC AAGCACGAAA ATTCCACGAG GTCAACTACA GTCAGAAAGC 10860 

TGAACAACAA GCCAAAACGC CCAAAAAAGG CGGCAAAAAG CAAGCACCTG CAAGCAACGT 10 92 0 

GCCGAAATGG TCAAATCCTG ATTATGTCAA CGAATTAGAC CCAAAAATCG TTGATATGCT 10980 

AGTAGAATTT CACAAGTCAC AAGGCACTTT GGAAACTCCC GAGGCGCAAG CAGAAATCGC 11040 

CCAAAAACGT GAAGAAATCG AG C AAAGG AG AGCTGAGCTT GAGGGTAAAA AACAAGAGCT 11100 

TTTGAACCGC TTGAACAAAT AGAGTTTCGC AAGTATTATG CTTACAAATT ACTTGAGCAA 11160 

TTAACTAAAA TATAAACCCT GCCTTTATAT CTAGGCAGGG TTTATATTTT AGAAATTCAC 1122 0 

GTAGGTTGTT ACGGTTTTTA CATACCCAGT ATAGTTTGAG TTTCTATAGT ATTCAGTGAT 11280 

AAACTTCCAT TTTCTTTGAG CAACATGGAT ATAAGTACTT GTTATGTAGT ATGGATATGG 1134 0 

GCTTTGTGAA TCCAAGTAAG ACTGATAAGC TTGT AT AC C A AAATATGCTC CACCAATTAT 11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 114 60 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 1152 0 

ATTCCAAACA TTAGGATCGT ATG AT C T AAA AG AT AT ATT T AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 117 00 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC T C AC C AG AT A CAACTTTAGA 117 60 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG C TAG AC AT AC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT C T TAG AC AT A ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA T T AT AC AAT A AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12 000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12 0 60 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 1212 0 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC ma AATG C AT A 1218 0 

TTTATATTAG CAAAACGACG AT T TAAATC G TCGTTTTTTT GTAGTACGAC GGGCATGTCG 122 40 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12 300 

AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 123 60 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12 42 0 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12 54 0 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12 600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12 660 

AGGGG 12 665 
(2) INFORMATION FOR SEQ ID NO : 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO : 13 5: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT TCATTTAGTA TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 
TTGGTATGCT CCTGCTTTCA CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA 180 
AAAGCATTAT ATAATAGTGA TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 2 40 

AAATTCGTTT AAAAGATCTT ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 3 00 

GAAGTTTATT TCAAACAACC TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 3 60 

AATTAATGGG GTTATAATAA ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 42 0 

AGATGAGTAT T AAAAC TATA AGGAGGACGA AGGTGGCTAA AAATTTAAAA TTAAAATTAG 4 80 

CTCGGGTAGA GCGTGATTTA AC AC AAGGT C AACTGGCAGA GGCTGTCGGG GTGACACGCC 54 0 

AGACTATTGG TTTAATAGAG GCGGGAAAAT AC AATC CC AG TCTCTCGCTC TGCCAGTCTA 6 00 

TTTGCAGATG TTTAGGGAAA ACCCTAGACC AACTATTTTG GGAGGAAGAA GATGAAAAAT 6 60 

AGATTTTATT ATTCTCAATT ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 72 0 

TCTGAAAGTT TCTATATCTG CATTGCTTTG TCGCTCCTAT C T TAT AT CAT TTCAGTATTA 780 
GCACCAAGCC TTTTTAATTC TAATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 840 
TTTTTCAATC GTGCCCGTTA TCTGGGAGTG ACCTACTATG GTCGTTTTCA TTTTACGATT 900 
TTGGGTTGTT TTTTCCTAAC CTTGGCTATT ACGGCTCTTT TGATGTTGCA GAATTATCAA 9 60 

TTCAACATAG AAATTTATCA GCACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 102 0 

ATTACTTATA TCATTTACCT TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 108 0 

GGCGAATGGG CTCAGAAAAA ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CTTGTTACTC TTTTCTCAAT CCAGCTAAAA TGTGATATAA TAGTACTAAT TTATTGGAAT 12 00 

ACATGAAAGT TCTTGAAAAT TTTCATGGGT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 12 60 

TCCAGATGAT AGTTTGACAT TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 1320 

CTTTGACCAA GGGATTCACA ATAAGAAGGC GGTCTTTGAG GTGTATTTCC GCCAACAGCC 1380 

TTTTAAGAAC GGCTATGCGG TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 1440 

CTTGCGTTTT TCAGATAGTG ATATAGCCTA TTTGGAGTCG CTTGGTTATC ATGGGGCGTT 1500 

CTTGGATTAC CTTCGCAATT TCAAGTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 15 60 

TTTGGTTTTT GCTAATGAAC CGATTGTGCA GGTGGAAGGA CCTCTAGCCC AATGTCAGTT 1620 

GGTCGAAACG GCTCTTTTGA ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 168 0 

TCGTATTCGT TCGGTTATCG AAGATGAACC CTTGATGGAG TTTGGGACAC GTCGGGCTCA 1740 

AGAAATGGAT GCGGCCATCT GGGGAACACG CGCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

CAGCAACGTG CGTGCGGGTA AGCTCTTTGA CATTCCTGTT TTGGGAACCC ATGCCCATGC 1860 
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CTTGGTACAG GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 192 0 

AAATTGTGTC TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 1980 

TCAGGTGGCG CGTGAGCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 2040 

GGATATTGCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 2100 

GGCTAAGATT TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 2160 

AAAGGCCAAG ATTGATGTCT GGGGTGTGGG TACCAAGCTG AT T AC AG C CT ATGACCAGCC 2 2 20 

GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT GAAACTGGTC AGATGCGCAA 22 80 

TACGATTAAG CTGTCTAATA ATGCTGAAAA AGTTTCTACG CCAGGTAAGA AGCAGGTGTG 234 0 

GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATGATGGTGT 2 400 

GGATATTAGC G AC AT G AC AG AAATCAAGAT GTTCCATCCG ACCTATACAT ACATCAAGAA 24 60 

GACGGTTCGT AATTTTGATG CCGTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 2520 

AGTTTACAAC TTGCCTAGTT TG AC TG AC AT TCAGGATTAT GCCCGTAAGG AATTTGACAA 2 580 

GTTGTGGGAT GAGTATAAGC GTGTGCTCAA TCCGCAGCAC TATCCAGTGG ATTTGGCGCG 2 640 

TGATGTATGG CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 2700 

AGGAGAAGAA GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTGT CAAACCAGTG 27 60 

ATTGATGCCC AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAAAAGATA TCTGAAAAAA 2 82 0 

CATCCCTTCC TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AACCTTGGCA 28 80 

GGACGTTTGG CGCAATTAGC TATGGAAGAA CTGCGAGCTG AAACGGGAGA CGATAGCTAC 294 0 

AAATTTATCG CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 3 000 

GCCCTAGCCT TCATCCAGCC AGATGTCAGC TTGGTTGTGA AT AT C AAGG A ATCAGCTGAT 3 0 60 

GCCATGACAG CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 312 0 

ATCAAGGCAC GTTGCCGTAT GATTGCTCAG TATGCCCTTG CTGGTTCCCA TAGCGGAGCG 3180 

GTCATTGGAA CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGGTGAC 324 0 

GGCGGTGCGG ATATTCTCCC TCTTTACCGC CTCAATAAAC GC C AAGG AAA ACAGCTCTTG 3 3 00 

CAGAAACTTG GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 33 60 

GAT AAAC C AG GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 3 42 0 

TACCTAGAAG GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 3480 

AAAGGC C AAC ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 3540 

GGTCCGGGGG ACCTTTTTAG CTTCTTGCCC TG AAAT T AAA AAGCAAGAAA AACCTCCACT 3 600 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3 6 60 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3 72 0 

TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 37 80 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3 840 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3 900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 39 60 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 402 0 

C C AT CCTG AT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 408 0 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 42 00 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 42 60 

TTTTACTCAA GCAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 43 2 0 

AGGTCGTGTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 43 8 0 

G AG AAAAGG C TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 444 0 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA AG AC AG ATG T TCTAGTTCCT 45 60 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4 62 0 

T AAT CG ACT A GATTCTCCAG CCGAACAGG1 GGTAATGTAC TTTTTATAGT GTAATCCTAG 4680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 474 0 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 48 00 

CTATTCCCAA AT CAT T C AT A CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 48 6 0 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC C AG T AT CGTT 4 92 0 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAG AC TAG AC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 522 0 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT AC AG C TC AT C ATCATACGAA 52 80 

CTTCGTTTTT GATTAAGGTT GAACT 5305 
(2) INFORMATION FOR SEQ ID NO : 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 6: 

TGGCAGCTCG TCGTCGTAAA GGACGCAAAG TTTTGGCTGC AT AAT C C AAA CGAATTCTAT 60 

CAAAAATCAG TAGGAACTCG AGT C T ACT G A TTTTTATTTT TGTAAAAAAG TTCAGTAGAT 12 0 

GCAAATGGAT TCGGAAGCGA TGTTACAGTA GATTGAAACT AGAATAGTAC ACCTCTGTTT 180 

CTAAAACATT GTTAGAAATC GATTTGACTG TCCTGATCGA TTTGTCCTGT TATTATTTTA 24 0 

TTTTACTATA AAGTTGAAGT AGGTGGAGAT GGTACAGCAA CAATCGTCTT TAAAGATGGT 3 00 

TCAGCTATTA CAATTCCAGG AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC 3 60 

ACTAAACTGA CTGCTGAAAA ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA 42 0 

G AT AT AAC T C ATTTAACAGA TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT 480 

TC AG C ATT AG ACGGAGCGAC AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC 54 0 

CCAGATGGTT CAGTAGTGAC GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT 600 

GAATCTGTAA CTCAAGAAGC TACACCAGAG TATAAGCTAG AAAAT AC AC C AGGTGGAGAT 660 

AAGGGAGGCA ATACTGGAAG C T C AG AT GC T AATGCGAATG AAGGCGGTGG TAGCCAGGCG 72 0 

■ 

GGTGGATCAG CTCACACAGG TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA 780 

GCTACTGAAA AAGAATCAGC TAAAAATGCC AT TG AAAAAG CAGCCAAGGA CAAGCAGGAT 84 0 

GAAATCAAAG GCGCACCGCT TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA 900 

GCAGAAAAAC AAGCAGCTCT CAAAGAGATT GAAAATGCGA AAACTATGGA AG AT GTG AAG 960 

GAAGCAGAAA CGATTGGAGT G C AAG C CAT T GCCATGGTTA CAGTTCCTAA GAGACCAGTG 102 0 

GCTCCTAATG CTGCTCCTAA GACAACAAGT GCACCGCAAG CAACTGCAGG AACAATGCAA 108 0 

GATGTTACCT ACCAGTCACC TGCTGGCAAA CAATTACCTA ACACAGGTTC AGCATCAAGT 1140 

GCAGCACTTG CTAGTCTTGG TCTAGTGGTG GCAACAAGTG GTTTTGCTTT GCTAGGAAGA 1200 

AAGACTAGAC GTAGAAAATA G AAC AG CT AG AAAATT CT AT TCTCTACTTA AAGTTAGATT 12 60 

ATAAGGGGGA TTTTGAGAAG TCATCAATCC TAGTGATGGG TGAGAAAAGT GAGAACCCAA 132 0 

GATAATCACA TACTTTAGCT GAATAGGAAT ATTCTATCAA TGTAGCCAAT CTCTTCTGTC 1380 

TCTAACTGTG GAATAGGAGA TGGGCAATAT CGGATAGAAA AGATAGCAGA ATAGCTCTCT 144 0 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT AGGTGCCCCT CCTCTTTTTT GGTATAATAG 1500 

AAGATAGAAA ACGAGGTTAG AAG AG AT GAT TTT TG AT AC A CATACACACT TGAATGTAGA 1560 

AGAATTTGCA GGTCGTGAGG CAGAAGAAAT TGCCTTGGCT GCTGAGATGG GTGTGACACA 162 0 

GATGAATATT GTTGGTTTTG AT AAAC CG AC GATTGAGCAT GCCTTGGAGT TGGTAGATGA 1680 

GTATGAGCAG CTCTATGCGA CTATTGGTTG GCATCCTACA GAAGCTGGTA CTTATACAGA 174 0 

GGAAGT T GAG GCTTACTTGT TGGATAAGTT AAAACATTCC AAGGTTGTGG CTTTAGGTGA 1800 

AATTGGCTTA GATTACCATT GGATGACAGC GCCCAAAGAG GTGCAGGAGC AGGTTTTTCG 18 60 

CCGTCAGATT CAGCTATCTA AGG ACT TGG A TTTGCCTTTT GTTGTCCATA CCCGTGATGC 192 0 

GCTGGAAGAT ACCTATGAGA TTATCAAGAG TGAGGGCGTT GGTCCTCGTG GT GGT AT CAT 19 80 

GCATTCATTT TCAGGGACGC TTGAGTGGGC AG AG AAGTT T GTGGATCTTG GTATGACCAT 204 0 

TTCCTTCTCA GGAGT GGTG A CTTTTAAGAA GGCAACTGAC CT C C AAG AAG CAGCTAAAGA 2100 

GTTACCTTTG GACAAGATGT TGGTGGAAAC AGATGCGCCT TACTTAGCAC CTGTACCCAA 2160 

GCGTGGTCGT GAAAATAAAA CAGCCTATAC TCGCTATGTG GTCGACTTTA TCGCTGACTT 22 2 0 

GCGTGGTATG ACGACAGAAG AGCTGGCGGT AGCAACGACT GCAAATGCAG AACGAATTTT 22 80 

TGG AC T GG AC AGCAAGTAAT GAAAGAGAAA ATTTCTCAAG TTATCGTGGT TGAAGGGCGT 234 0 

GATGATACGG TCAATCTCAA ACGTTATTTC GATGTGGAGA CCTATGAGAC TCGAGGTTCT 24 00 

GCCATCAATG CTCAGGATAT AGAGCGGATT CAGCGCCTGC ACCAACGTCA TGGAGTCATT 24 60 

GTCTTTACAG ACCCAGATTT TAATGGGGAA CGGATTCGGC GCATGATCAT GAT G GT CAT T 2 520 

CCAACAGTTC AGCATGCCTT TCTCAAGCGA GATGAAGCTG TTCCCAAGTC C AAG AC C AAG 2 580 

GGGCGTTCTC TGGGAATTGA GCATGCCAGC TATGAAGACC TGAAAACGGC TCTAGCTCAA 2 640 

GTGACAGAAC AATTTGAACA TGAGAGTCAG TTTGACATTA GTCGTAGCGA TTTGATTCGC 2 7 00 

CTTGGTTTTC TAGCAGGGGC AG AC AG C C GT AAG CGT AG AG AATATCTCGG AGAGACTCTC 2760 

CGAATCGGCT ATTCCAACGG CAAGCAACTC CTCAAACGCC TAGAGTTGTT TGGGGTTACT 2 82 0 

TTGGCAGAAG TGGAAGAAGC TATGAAATCT TATGAGTAGG AAAG AT GTAG CCGTTACAAT 2880 

TTTTTAAGTT TCACAGTATT TTTCGAAGCA GGTAGAAGAG GAGGCGTCTG ATGTTAATTG 2 9 40 

GTCAAAAAAT T AAAGAG AT T CGGATAGAAA AAGGAATTAG TCGTCCAGAT TTTTGTGGAG 3000 

ATGAGCAAGA ACTGACAGTT CGTCAACTGT CGCGAATTGA AAGTGGAGCT TCGCAACCGA 3 0 60 

GTTTGCCCAA GTTAGACTAT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 312 0 

CGGATTTTTC AGCTCTTCCT TCTGCTTATT TAGAATTGAA ATACCAGATT TTACGTGAAC 3180 

CAATCTATGG TAAAGAAGAG GAGTACGATA AG AAGG AAG C GTGTTTGGAA GAGATTTATA 324 0 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3 3 00 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 33 60 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 342 0 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AG AG AAT T AC TTTATAATTA 3 540 

GAG AT ACT CT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3 600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3 72 0 

CT G AAT CAT T TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 37 80 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3 840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3 960 

TCCG 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3964 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TGAGACCGTT ATTTGTATTA GGGAAATGGG T AT CT AT T T T TAATGCTGTG GGGATTTTGA 60 

TTGTTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTTAATCTAT 12 0 

ATAACATTTC ATCTTATATA GGTGATTTAG TTAGTTTCAC TCGATTGATG GCATTAGGAT 180 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTGT TGGTTTGTTT C CGGG AAT AT 240 

TGGCTAAACT GACAATTGGA TTAGTATTAT TCATTCTTTT ACATGCGATC AATATTTTTC 3 00 

T ATCGT T ACT AT C AGG AT AT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 3 60 

AGTTTTATGA GGGTGGAGGA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 42 0 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA AC AT TT AGC A ACTTATTTTT 480 

CAACCTATGG AGGAGCTTTC TTCGCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 54 0 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TGCCGCAGCT TTACTGAAAG 600 
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AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAAGGAT 6 60 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 72 0 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 
CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 
AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 9 00 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 102 0 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC AT G AACG AAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 12 00 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 12 60 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG AT ATT C AC AA CAAGAGGTCA 13 2 0 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 144 0 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GGATCGATTT 15 00 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 15 60 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT AC AATC C AAA GATACGGAGA CATTGGCATT TATCTTAGAA 1740 

TCAACTCCCT ATCATTTATC GATTGACATC TTAGAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA 1860 

ATTGTGACTT TATTTGCTTT ACGATATGTT TATCATAATA TCAAAGTTTT ATTAAAATCT 192 0 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT T T T T GAT AT A 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2 040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA AT AAT AT TCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTTAGATGAG 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT G ACT T TT AT A AT ATT ATT AC TGTAAAACGT 2 220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 22 80 

ATTTCTGCTA AAGAATTTAT AT AC AT T G T A GAAAATCAAG AAATATTTGT GTGGTTCAAT 234 0 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2 4 00 
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ACAATTTCAT CTTCTGAGTT AGAATTTTTA TGTGATTTAC TATTGTATAA AACTTTAGAT 24 60 

CAAGGAAGGT ACAATGTAGA GGGGCCGTTA GTTCTTGCTA GATATTTATT GGGATGTGAG 252 0 

TTTGAAGTAA AG AATC T C AG AATGATCATA TCAGCTCTTC AAAATACAAT TCCCTTTGAA 2 5 80 

TCAATAAAAG AAAGGATACG CCCACATTAT GGAAGCTAAT AAGTATAAAA TTGGCATAAT 2 640 

TGGTAGCCGT GAT ATT AT T T TACCATTTAG CATGATTGGG TTTGATATAT TTCCTGCCTA 2700 

CCAAGAACAA GAAGCTATAA AT AC AC T AAG AAAATT AG C T CAATCTGATT ATGGTGTCAT 27 60 

TT AT AT C ACT GAAGACATTG CTTCAATGAT ATTAGATACA ATTCGCCATT ATGATTCCCA 2 82 0 

AGTTGTGCCT GCTATTATTT T ATT AC CG AC TCATAAACAA GGTTTAAATT TAGGATTAAA 2880 

ACGTATAGAG GATAATGTAG AG AAAG C AG T AGGACACAAT ATTTTATAAT AAT GT AC AAA 294 0 

ATTGTCTGTA AT AT TAT T C T ATAATTTTTG GACTTAGTAA GGAGAATAAC TTTGACTCAA 3 000 

GGGAAGATTA T AAAAGT AT C GGGACCTCTA GTTATTGCAT CAGGTATGCA GGAGGCTAAT 3 0 60 

ATTCAAGATA TTTGCCGTGT AGGTAAGCTA GGGTTAATCG GTGAAATTAT TGAAATGAGA 312 0 

AGAGATCAGG CATCTATCCA AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT 3180 

GTTGTTACAA CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 3 240 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA TGATTTTCTA 3 300 

GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA AGTGGCATTT TGATTCCACT 3 3 60 

ATAGCAATTG GTCAAAAAGT GAGTACGGGT GAT ATT C T T G GAACTGTCAA GGAAACCGAG 3420 

GTAGTTAATC ATAAAATTAT GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTCTATTGCA 34 8 0 

TCTGGCGATT TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 3540 

TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT TTCTAAACGT 3 600 

TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA TTGATGCATT CTTTCCAGTA 3 6 60 

ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA CCGTTTGGAG CAGGAAAGAC AGTTGTACAA 3 72 0 

CACCAAGTAG CTAAATTTGC CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT 3780 

GGAAATGAAA TGACGGATGT ACTGAATGAG TTTCCTGAGT T G AT TG AC C C TAATACCGGA 3 84 0 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC TGTTGCTGCT 3 900 

CGTGAGGCTT CAATTTATAC AGG AAT T ACC ATGGCTGAGT ATTTTCGTGA T AT GGGCT AC 3960 

TCTGTCGCCA TTATGGCTGA TTCAACTTCA CGTTGGGCAG AAGCGCTACG TGAAATGTCA 4 02 0 

GGACGTCTAG AAGAAATGCC TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC 4 080 

GCTGAATATT ATG AAAG AG C AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA ACCAGTTACT 42 00 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 42 6 0 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 432 0 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 43 80 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 44 40 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AG AT AC ATT C ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4 560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4 62 0 

ACAGAGATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT 4 680 

TCAGAAGATA GAT T AGATG A AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 474 0 

TTGAT AT TAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4 8 00 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4 860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAGTTTTAGA GATCCACGAA 4 92 0 

GATAAAGCAA TGGTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4 980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 504 0 

TTTAATGGGA TGGGAAAACC AATTGATGGT GG AC C AG AT T TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA T CT AG AG AT T AT CC AGATG A AT T T AT T C AG 516 0 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 5220 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAAGCGACT 52 80 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT G T ATT TG C AG CAATGGGTAT TACTTTTGAA 534 0 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CG AT C GAT C G TTCGGTTTTA 54 00 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 54 60 

ACTGCGGCAG AGTATCTAGC TTTTGAAAAA GAT AT GC AC G TTCTAGTTAT CATGACGGAT 5 520 

ATG ACT AAC T ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 55 80 

AGACGAGGCT AT CCGGG AT A T T TAT AT AC A AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 57 00 

GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 57 60 

TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 582 0 

CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAG AT C ATGC TCCAACTATG 5880 

AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5 940 
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GGAGAATCGG CTTTATCTGA T GT AG AT AAA TTGTATGTGA GGTTTACAAA GCGTTTTGAA 6000 

GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 

GGGTGGGAAT T ACT AT C AAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 612 0 

CTTGATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 

ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGGAAC GTTTGACAAC 6240 

AGCTGAACGT GGACATAAGT TATTAAAGGA TAAAAGAGAT G AAT T GAT G A GGCGATTTAT 63 00 

TTCTTTGATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG G AAAGTT AT C TAATTGATAA 63 60 

TCTAAAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 642 0 

TTCAATTCCA T C G AAAG AAA T TG AAT TAT T TGTTGAGAAA GAAAATATCA TGAGTGTAAC 64 80 

AGTTCCTAGA ATGCATATGA AT AT T AC TT C T C AAAATG AG AACAGTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG AT GAT GT ATT TGCTACAATG AATAGTTTAA TTTATAAATT 6 600 

ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GC T GAT G AAA TAGAAAAAAC 6660 

ACGTAGACGT GTAAATGGTT TAG AAT ACT C GATTATTCCA AACTTGTCGG AAACTATTCA 6720 

TTATATAGAA T TG AAACT AG AGGAGGCAGA AAGAGCCAAT TTAGTTCGTA TTATGAAAGT 67 80 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATGAACAA AT AT C AG CTT GGATAAGGCT 6840 

TTAAGCCTTT CTAAGCTTTT TTT AT TG AC A GTATCAGGAT AT CTT T TT C A AAATTTTGGT 6900 

TTGTTAGATA ATGAAAATGT T T CT AC T AAT CTAGATTTAG GATTAGTAAA TCGTAAATGT 69 60 

AAT TAT AT AG AAAGTAAGCG CGTCATAACA AGGTATCTAT CATTCATGGA GCTCCTCCTG 7 02 0 

TATACTATTA GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7 080 

GT AGAG AT T C C AC AAT CT C G TCGTTTTGAT TCTAAAAAGA GAAATGATAT TCTGCTTAAA 7140 

ATTCGTATTG GCAAGCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7200 

CAGCTTTTGG ATAAGGTGTT GCTCTATGAC AATTCATCTA TCTAGCCTAG GGGAGGTCTA 72 60 

TCTCGTGTGT GGGAAAACTG ATATGAGACA AGGAATCGAT TCACTGGCTT ATCTGGTTAA 7 320 

AACCCACTTT GAATTGGATC CTTTCTCCGG TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7 3 80 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT T GGCT ACT AT ATAAACGCTT 7440 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAAAAGGAT GTCAAAGCTC T C AC ACC AG A 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAATATAGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7 62 0 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAAT C CAT C A GAAAGTCGTG ATTTCTATTG 7 6 80 
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AAATGAGGAC TTTCTTTTTA TACT CAT CTG CTTTCAAAAA GCATTCTAGT CCATCTCCGA 774 0 

TTAACGATGG ACTTTATCAC CTCCTTCTCC AGTCCTTGTA TAACATCTTG GAGTTGATTC 7 8 00 

ATGACATCTT CCAAAGTTTA AAAGGCTTTA TTCTTAAATC CACGTTTACG AATCTCTTTC 7 8 60 

CACACTTGTT CAATGGGGTT CATCTCTGGT GTGTATGGAG GAATAAATGC AAAGCCAATA 7 92 0 

TTAGTCGGAA TCTTTAAGGT ACTTGATTTA TGCCATATAG CATTGTCCAT AACGAGTAAA 7 9 80 

AGATAATCAT CTGGATAAGC TTGTGAAATC TCCTATTCCT AAAGCCCCTT TAGCGCATAA 8040 

CTTTGGCTCA GCTTCTATTA TCGCTCACAC CAT CC AT C AG AAGTTTAATC TGAAGGTACC 8100 

CAATTATCGC CAAGAAGAAG ATTGGGCTAG GATGGGTTTA CCAATCACAC GTAAGGAAAT 8160 

CTCTAATTGG CATATCAAGG CGAGTCAATA CTATTTGGAG CCCCTTTATA ACCTCTTGCG 82 2 0 

AGAGAGACTA TTGACTCAGC CCTTACTTCA TGCGGATGAA ACTTCTTATA GGGTGCTAGA 82 80 

GAGTGATAGT CAGCTGACTT ACTATTGGAC TTTTTTGTCA GGTAAAGCAG AGAAACAAGG 834 0 

GATTACGCTT TACCACCATG ATCAGTGTCG AAGTGGTTCA GTAGTACAAG AATTCCTAGG 8400 

AG AT TAT T CT GGCTATGTGC ATTGTGATAT TTTGCGGCAG TAACTTAGGA CTTTAGTCCT 84 60 

CTAGTTCTGC CTATGCGATA GCAGTCCAAG GTTTAGGAGC AAGGCGACGC TAAGCTTGGT 8 52 0 

AAACTTCGAA CCGCTCGTCT GCTTATCGTC AACTGGAAGA AGCTGAACTT GTTGGATGTT 8580 

GGGCGCATGT GAGAAGGAAG TTTTTTGAAG CGCCCCCCCA AGCAAGCGGA TAAATCATCC 8 640 

TTAGGAGCTA AAGGT T TAG C TTATTGTGAT CAGTTATTTT CCTTGGAAAG AGACTGGGAG 8700 

GCTTTGCCAG CTGATGAACG AC T AC AGAAA CGTCAAGAAC ATCTCCAGCC CTTAATGGAA 87 60 

GACTTCTTTG CTTAGTGCCG GCGTCAGTCA GTTTTAGCAG GTTCAAAACT AGGAAGGGCA 8 82 0 

ATTGAATACA GCCTCAAGTA TGAAGAAACC TTTAAGACCA TTTTGAAAGA CGGACATCTG 88 80 

GTCCTTTCCA ATAATCTAGC TGAACGCGCC ATTAAATCAT TGGTTATGGG ACGGAGTAAA 894 0 

AGAGTCCAGT GGACTCTTTT AGCCTAAGCT CAGTTTAAAA AAGCGAGGGT GGTTATTTTC 9 000 

TCAAAGTTTT GAAGGAGCTA AAGCAAGAGC T ATT AT TAT G AGTTTGTTGG AAACAGCTAA 90 60 

ACGTCATCAA TTAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC TTCCAAACGA 912 0 

GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCTTATTTA CCATGGACTA AAGTTGTACA 9180 

AG AAAAGTG C AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT CCAGTCTGGA 92 4 0 

GATTTTTCAA TAGACTTCCT G CGAAAC AAA ATATGGTATA ATAGTTCTAT GAATGATGAA 93 00 

GCAAGTAAAC AACTAACCGA TGCACGATTT AAGCGTCTTG TTGGTGTTCA ACGCACGACT 93 60 

TTTGAAGAGA TGTTAGCTGT ATTAAAAACA GCTTATCAAC TTAAACACGC AAAAGGTGGA 9420 

CGAAAACCTA AATTAAGTCT AGAAGACCTT CTTATGGCCA CTCTTCAATA TGTGCGAGAA 948 0 
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TV TV ^*T TV f Tl TV ^ ^ «T f'lT 

AAGATAAAGT 


C T AT AACC AT 


GCGCTATCTA 


AGGAGAGAAG 


9960 


LAAGGTTGAG 


Tv Tv ft tv m frr ftt m *»"n 

AACATCTTTG 


CCAAAGTAAA 


AACGTTTAAA 


ATGATTTCAA 


CAACCTATCG 


10020 


AAAT CAT C T A 


AACGCTTCGG 


ATTACGAATG 


AATTTGATTG 


CTGGTATTAT 


CAATCATGAA 


10080 


C T AGG AT T C T 


AGTTTTGCAG 


G AAGT C TAT T 


ATCAAAAATA 


CCATCAAGAT 


TATATAAGAT 


10140 


m ft "a m tv ft is #t s~t tv 

I GATACAGGA 


tv tv tv ^immmrTT -» m 

AAAGTTTTAT 


TTGATGGTGT 


AAATATTAAT 


CAAATAGATA 


AAAAAATATT 


10200 


7\ T\ m TV TV TV TV m 

AAGT C AAAAT 


TTAGGAGTAG 


TTCCACAGGA 


TTCATTTTTA 


TTGAACCGAA 


GTATTCTTGA 


10260 


rn tv tv rn 7\ rn tv tv /^*m 

iAAlAlAACT 


m m TV TV TV ft ft TV ft ft 

TTAAAGCACG 


TV TV /TfTlfTVTi ffl Ff 1 

AAGTTACTTC 


ACAAAAGATA 


GAGGAAGTTT 


GTAAAGCAGT 


10320 


rrkft tv 7\ tv mrifn tv m 

1 LAAAI C TAT 


ft Tv m /"i tv tv tv m ft tv 

G ATG AAAT C A 


TGGCTATGCC 


GATGAAATTT 


AATACTATCA 


TCTCAGAGAT 


10380 


/""I /T ft /T m TV 7\ t\ m 

GGGGTCAAAT 


t I 1 M 1 ri'l j^n TV **T ITT ^— >4 

ATTTCAGGTG 


GGCAAAGGCA 


ACGGATAGCA 


CTGGCACGTG 


CATTAATAAA 


10440 


m T\ 7vm/*irTn?i rrn 

1 A/vl LLTAGT 


ATTGTAATTT 


III TV TV rf^r ** "» * m % 

TAGATGAAGC 


AACTAGTGCA 


TTAGACACTA 


TTAATGAGGA 


10500 


T\ TV /"T TV TV m TV TV /"I TV 

AAGAATAACA 


tv tv ft m tv .m tv m tv ^ti 

AAGTATATAC 


AAAGTCAGGG 


CTGTACTCAA 


ATAATTGTAG 


CTCATAGATT 


10560 


/^•m/^ A A /~^/~* a mm 

G 1 C AACGATT 


TV TV ft ft TV m/^t/^#-i/-i 

AAGGATGCGG 


ATGTTATTTT 


TGTAATGAAA 


GGTGGTAAGA 


TTGTTGAGTC 


10620 


A f~^{~* A A AmpJlfTl 

AuuAAA 1 L. A 1 


TV tv *o m tv /^mm tv tv 

AACaTACTTAA 


TTtft ft Tv m /T m m 

TGGATCTTGG 


TGGAGAGTAC 


TACAGCTTAT 


ATACAAAAAG 


10680 


2\ A A rnr~« Tiprrr 
Vjj/V/VA 1 tjAvjLj 1 


ft m TV tv ta /"T. ta TV TV TV 

GTAAAGAAAA 


JTl f* TV TV i^r TV TV TV ft TV 

TGAAGAAAGA 


TV TV TV /TV S" — 1 Tl T*. ill -n i T 1 

AAATGAATAT 


GTAATTTTAA 


CAACAGCCTC 


10740 


ACTAGGGGTG 


ATGATTGGAA 


TAGTGTTTGC 


AATTTTTTTA 


GATTTTCCAG 


TTGAATATGG 


10800 


TATTTCTTTA 


GGCTTGTTGA 


ATGGAATAGT 


ATTGGGTTCG 


CTGATTGTTT 


ACAAAAACAA 


10860 


TAAGAATTAA 


GCATAATTTT 


TTGCTGTAAA 


CTAAGGAGTA 


GAGATGGCTA 


TAGTTGAAAT 


10920 


TATAAATCTA 


AC AAAAAG CT 


TTAAAGATAT 


TGAAGTTATT 


CATAACACTT 


AAATAATAGA 


10980 


GCAACTACAG 


TAGTAGCTTA 


AAAACATGAT 


T AAATCG CT A 


TTCTTAGGAG 


TAGCGGTTTT 


11040 


TCTTTTTGTT 


TAATACTCTT 


TGAAAATCTC 


TTCAAACCAC 


GTCAGCTTTG 


CTTTACCGTA 


11100 


C T C AAGT AC A 


GCCTGCGGCT 


CGCTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATAAAA 


11160 


AGGGTCAAGT 


AAGTATAGTA 


AATTGAAATA 


AGATATGAAC 


AAATCGATTA 


GAAAAGTCAA 


11220 
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ATTAATTTCT AGAAATATGT TAGAAATTGG TTTGAATTCC GCAATCAATT TGTTCAGTTT 112 80 

TTATTTCATT TCATTTTATT TAATTAGATT TTCCAATTTT TTAATTCAAG CTAAAAATCC 1134 0 

CCAATCGTAG TGATTGAGGA TTGAGTAAAT AAATCTTAAA C AATAC CTTG TGCAATCATG 114 00 

GCATTTGCTA CATTTTCAAA GGCAGCAATG TTAGCTCCTG CAAGGTAGTC TTTATCAAGA 114 6 0 

CCGTATGTTT CTGAAGTCGT TTTAGCTGTG TTGAAGATGT TTGTCATGAT GTCTTTGAGA 1152 0 

CGGCCATCAA CTTCTTCACG AGTCCATGAG AGGCGAAGAC TGTTTTGGCT CATTTCAAGA 11580 

GCTGAAACGG CTACACCACC AGCGTTGGCA GCTTTTGCAG GTCCGTAGAA GATACCATTT 11640 

TCTTTGTAAA CTTTGATGGC ATCAAGGTCG CTCGGCATGT TGGCACCTTC AGATACACAG 11700 

ATAACGCCTT GAGCAACCAA ACGTTTAGCT GCTTCACCGT TGATTTCGTT TTGAGTGGCA 117 60 

CATGGAAGAG C AATGT CAT A GTTTCCAGCG TAAGTCCATA CAGTACCTTC GTGGTAGGTT 1182 0 

GCAGTTGCTT TTTCAGCTGC ATACTCAGTC AAACG AG C AC GACGTTTTTC TTTAACATCA 1188 0 

ACCAAAAGAT CGAAGTCGAT ACCATTTTCA TCGATGACAT AACCATTTGA GTCAGAAACA 11940 

GAAATAACAG TTGCACCGAG TTCAGTTGCT TTTTGAAGAG CAT AT TG AGC AACGTTACCA 12 000 

G AACCT G AAA TAACGACTTT CTTACCAGCA AAGCTGTTAC CGTTAGCTTT GAG C ATT T C T 12 060 

TCAGTATAGT AAACCAAACC GTAACCAGTT GCTTCTGGAC GAATCAAGCT ACCACCAAAT 12120 

CCAAGAGGTT TACCAGTCAA G AC ACC AG C A TCAAATTGGT TAAGACGTTT GTATTGACCG 12180 

TAAAGGTAAC CAATTTCACG TCCACCAACA CCGATATCAC CAGCAGGTAC GTCAAGTGAT 12240 

GGTCCGATGT GTTTTTGCAA TTCAGTCATG AAGCTTTGGC AGAAGCGCAT CACTTCAGCA 12 3 00 

TCTGTTTTAC CTTTAGGATC GAAGTCTGAT CCACCTTTAC CTCCACCGAT AGGAAGTCCA 12 3 60 

GTCAAGACAT TTTTAAAGAT TTGTTCAAAT CCGAGGAATT TCAAGATCCC TTGGTTTACA 12 42 0 

GTTGGGTGGA AACGAAGTCC ACCTTTGTAT GGTCCAACAG CTGAGTTGAA TTGAACACGG 124 80 

TAACCACGGT TTACTTGAAT TTTTCCATCA CGGTCAACCC AAGGAACACG GAAAGAAACC 12 540 

ACGCGCTCAG GCTCAGTAAT ACGTGCCAAG AT ATT T TC T T CGATATACTC AGGGTGTTTT 12 600 

TCAAATACAG GTTCTAAAGT GTTGAAAAAT TCTTCAACAG CTTGGAGGAA TTCAGCCTCG 12 660 

TGCCGG 12 66 6 
(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

AGCAACTGTT GTGAACCAAT TCCGATAAAT TCCAAGAATT GGTTAATAGA GCCATTTTGA 60 

CCAAAAATCC CGATAAAAGC ATAGGCTTTA AGGAGCAAAT TGATCCAGGT AGGAAGGATA 12 0 

ATCAGCATGA GCCAGAGTTG ACGGTGTTTG AGACGGGTCA AAAAGAGGGC CGTCGGATAA 180 

CTGATAAGCA GT GC C AC AAA GGTCACAATG CCTGCATAAA GCACTGAGTT GAAACTCATT 24 0 

TTAAGATAGG TCAAGTTTTG TGACGCAAAG TAAGATTTGT AATTTTCTAA ACTGAACTGG 3 00 

CCTTCGATGT TGAAAAAGGA TTGACCGAAA ATCAAGACCA AGGGTGCCAA T AC AAAG AG C 3 60 

GCAATCCAAA GCATGTAGGG T AC T AC AAAG AGTTTAGAGC TTGTTTTCTT CATCTCTTTC 420 

CTCCTCGATT GCATTGATCA AACCTGCTTC TTGCTCTTCG ATTTCTACGT ACTCCTCAAT 480 

ACGAGCATCG AACTCTTCTT CGGTTTCATT GAGACGCATG ATGTGGATGT CTTCTGGTTC 54 0 

AAAGTCCAGA CCGATTTCCT CACCCACGAT AGCCTTACGG GTTGAGTGGA T CAT C C AT TC 600 

ATTTCCAAGT TCGTCATAGG CGATAATTTC ATAATGAACT CCACGGAAAA GCTGGGTATC 660 

GACCTTAACT TGGAGCTTGC CTTCTTCAGG AAGGGTAATG CGCAAGTCCT CTGGACGAAT 72 0 

AACGACCTCA ACAGGTTCAT TTGGCTTCAT CCCACCATCA ACCGCTTCAA AGCGTTTGCC 7 80 

GTTAAATTCG ACCAAGTAGT CCTCAATCAT GGTACCTGGC AAGATGTTTG ACTCCCCGAT 84 0 

AAAGGTGGCA ACAAAGTGGT TGATTGGCTC ATCGTAGATG TCCACAGGGG TTCCAGACTG 900 

GACAATCTCG CCATCATTCA TAACGAAAAT CCAGTCACTC ATGGCAAGAG CTTCTTCCTG 9 60 

ATCGTGAGTG ACAAAGACAA AGGTAATGCC CAATCGTTGT TGTAATTCAC GCAATTCGTA 1020 

CTGCATGTCT GTTCTCAATT TCAAGTCCAG CGCTGATAAA GGCTCGTCCA ACAAGACCAC 108 0 

ACGGGGTTGG TTGATGATAG CACGGGCGAT GGCCACACGC TGACGTTGTC CTCCAGAAAG 1140 

TTTGCGGATG GAACGTTTTT CATAACCTTC CAACTGAACC ATCTTGAGAA CTTCCGCTAC 12 00 

ACGCTGCTCG ATTTCTTTCT TATCAATTTT ACGCAAGCGA AGTGGAAAGG CAACATTTTC 12 60 

AAACACATTC ATATGTGGGA ACAAGGCATA GGATTGGAAG ACGGTATGTA CGTCGCGCTT 132 0 

GTTGGTTGGA ATATCATTGA TACGAACACC GTCTAGCATG ATATCTCCTG TCGTCGCATC 13 80 

CAGTAAACCT GCAATAATGT TTAGGATAGT TGATTTCCCC GAACCAGATG CACCTAGAAG 1440 

GGTGTAGAAT TTCCCTTCTT C C AACT C AAA GTTGATGTCT TTGAGAACCT TGGTGTTGCT 1500 

GTCTTCAAAA ACTTTAGAGA CGTTTTTGAA TTCGATAATT GGCTTTTTCA ATTGGCATAA 1560 

ATTCCTTCTT TTTCATAGAT TAACCGATCG GGGCTCTGTC AGGTCCCCAC TACCTCTTGC 1620 

AGGGAGTAAA ACCACCTGCA TACATCTTCG CTACCGATAG GCTTTCACCC AAGATCCGGA 1680 
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CTTCTCTTTC AAGCGTAATA CCTGAGTGTT CCTTGACTTT TTCGATAACC GATTGGATCA 17 40 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 180 0 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTGAA ATTAACTGAC I8 60 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACG TGATGGGTTC 1980 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATAGCTGAAT 2 04 0 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAG ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2160 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAAGCCA GTTAAACTAT 222 0 

GACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2 2 80 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCGAC 2 340 

CATCACGAAC GATGATATTG CTTGCATTGC CAAGAACCAT CCAAGGGATA TTTTCTTGGT 2 4 00 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2 4 60 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGAT ATCAA 2 520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 2 580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAG CTTGTTTGGA 2 64 0 

TTTTGCATAA G AAAAAG AG G TTCCCCCCTT TTTATGATTT TTTACAAAAG ATTTCCTTGG 2700 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2 7 60 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AAGCCATAGG 2820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACAGCTTG GTAAACCCAA ACTGTCAAAA 2 88 0 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2 940 

AAAGTGACTG AATGGTTTTT AACATATTTT CAGACAGTAA TTGCATAACC TGTGGATTCT 3 COO 

GAGTTTGAAC TGCCGAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGG CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 3 083 
(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi} SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA 
TACTGCGACA 
TAAAGGTTTG 
TCTTATACTC 
ACTGTTTTGA 
GCTGACGTGG 
CCATATCATA 
CATTTTGTCG 
AAATCAACTT 
CACGATTGAT 
GAGACTCCTG 
TTTCAATAAT 
CACGGAGACG 
CTTCATGAAG 
AGGCAAGTTT 
AAGCACGATT 
TAAGAATCAA 
AAATGAGATT 
TTCTGGTGTC 
GCATAAGATA 
CTGTTAAAAA 
GTTCTAGATT 
AAAAGGTCAA 
TTCTATAGGC 
GGGCGATAAG 
CTGCTATCAT 
TCACTGTATG 
TCTTGAAGCG 
GAACTTCTTC 



TTGACCACCA CCAAAAGCAG 
CTGGTCAACT CACTATCTGA 
TCTATCAGCT CTTTCCACCA 
AATGAAAATC AAAG AG C AAA 
GGTTGTAGAT AAGACTGACG 
TTTGAAGAGA TTTTCGAAGA 
CGGTAGGGTA TTGGC AG CTT 
TAATTTCTTG GCATACTTAG 
GCGCTCCAGA TAATAGCCTC 
AACCCGTTCG ACAAAGGCAC 
CATAAAACTA AT C AAAG AG C 
CATCTCTGTA TTGGCAAGAT 
GCTTTGCTCC TTGTTCTTCC 
GAGAAAACGT AAAATCCTCA 
ACGATTGATT CCTTGTTCTA 
GCTAATGTTC CCCTCTTCAT 
GAGTTTCAAA GCAGCCCAGT 
TTCAATACGT CCATGATAAT 
TTCCAACTCT TTTTCCAACT 
ATCCTTGCTT TCTTCCTCTT 
GCTAACAAGC GTCACACCTG 
ACT T GGT ATC AAGAGAATCG 
GAGAAACATG TCCTTCATAT 
ATAATAGCCA TAGATCATAA 
AG AT AG C AAT AAAAGTAGAG 
TTCCAACTCC ATCCCTAAAA 
CCAGACCGTC TCGGTCACCG 
ACTTGCCTTT AAAATTCCAG 
TGCCAGAAAG AAGGTCACTA 



931 

GGGGAAAATC GAAATCAACC 
TGCTTGATAA TAATGCAAAA 
CTTTTTCATG TCATACTCCT 
CTAGAAAGCT AGCCGCAAGC 
AAGTCGATCA CATACATACG 
GTATTAACTA ATTTCTTCTT 
C CTTCAAGG A ATAGTTCTCT 
TCGTAATCAA TCGTTTTTCT 
TCAGCATTTC ATCGATATTG 
CACTGCTGAT AATAGCTGTT 
GTCTGTAGAC TCCCTTCAGG 
AGAGCTCTGC AATTTGGTCA 
AGCTACGGAA GGTCTTTCCG 
AGGAAACAAG AAAATAATAG 
TATTTTTCAG ATAACGTTGG 
AGGCCTGTTC CAAACCATCA 
CTTCTTGATC ATCCTGGTTT 
TGTCAATAGC CGCATAGAGG 
CTAGCGTTAC TTCATTCAAA 
C AT C AG AAAG ATGAGGCAAG 
CAACAAGGAA AAGCAAAAGA 
TAGCAATCGA CACCGTTCCC 
ACTTATTTAG CTTTTTCTTG 
TAAAACGAAT GACAAAAAGG 
GATTATAGAT TGGATTGGTC 
T C AC AAAG AC AGAACCGTTG 
TAT C C ACT T G GGCTTCGAGG 
CAACTACGAC GGCAATAATA 
GAGGCAAACT CAATTCTAAT 



AATAGTAGGC 60 

AAGCTTTTAA 12 0 

TCACTTATAA 180 

TGCTCAAAAC 24 0 

GTAAGGCGAC 300 

ACCAATTCCA 3 60 

AAGTTATTTA 42 0 

TCGTATTCGA 4 8 0 

TTGGGTTTGA 54 0 

TCTCGAAGAC 6 00 

TTTTCCAAAC 6 60 

TAATCAAGAG 72 0 

AGAGTAAAAA 780 

GTCAGTCTTG 840 

TAAACTCGGT 9 00 

CTTTCAATAC 960 

TCTTGGCTTA 102 0 

GGAAGTTTAT 10 80 

ATGGCGATAT 1140 

AC C AAG AG AC 1200 

GGATACTCCT 12 60 

TTAACACCTG 1320 

AGGCGTCGGG 13 80 

ACAAAGGTAA 1440 

AAGAT AGGTT 15 00 

AGCATAAAGG 15 60 

AGCGTGATTT 162 0 

CCTGAAACAT 1680 

AAAAGT T C AC 1740 
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TGGCAATATC 


CGTTGCGCGC 


ACACTTAGCA 


AGAAGGTATG 


GAGGAAGCGG 


TTGGTCATGG 


1800 


CTGTTAAAAA 


TCCAATTAAA 


AAACCGCCTA 


GGATTGAAAA 


GATGAGCGAA 


CTGCTAGCTT 


1860 


GC CC C AG AG A 


AAAAGCTCCA 


GTTGTCCAAG 


CTGTCAAAGC 


TACCTGAAAA 


GCCACCAAAC 


1920 


CAGAAGCATC 


ATTCAAGAGT 


CCTTCGCCCT 


TAAGAATATT 


GGACACGCGC 


TTAGGAAAGC 


1980 


TAAAACGCTC 


CGAAAGAGAG 


GCAAAGGCCA 


CCAAGTCCGT 


AGGACCAAGG 


GCTGCCCCAA 


2040 


CAGCCAAGCA 


AGCTGCCAAG 


GGAAGGCTGA 


ACCAAAGAAG 


ATGGGCCAAG 


CCACCCAAAC 


2100 


TCAGGGTCGA 


G AT AAAAAT C 


ACTGGAAATA 


TGAGATAAAC 


AATGATTCGC 


CAGTGTTTTA 


2160 


AAATAGCCGT 


AAC AT CTGCT 


TCTTCAGCCT 


CTCGGAAAAG 


CAAGGGTCCG 


ATAACCAGTG 


2220 


CCAAAAACAA 


CTCCGTATTA 


AGGTGAAAGT 


CAGTATTGGG 


TAAAAAGAGA 


CCAATCACAA 


2280 


TTCCCAAAAG 


AATTTGCACC 


AAAGGGAGAG 


GCAAAAAGGG 


CAGGAGCTTA 


TTGGTTGTAC 


2340 


TTGAGACAAT 


CAAAACCAGT 


AAAAAT AGGA 


TGAGGTAAAT 


CAGTAATTCC 


ACGCACGTCC 


2400 


TCCTTAATCT 


TT T T T AC AAC 


AGGATTCAAA 


TATCTCCTTC 


TGCTCTTTGA 


TTTTTTGGTC 


2460 


AATCTTGGAA 


CAGTCTTTGT 


GCTCAATTTT 


TCTCTGGCAC 


CGTTCCATTT 


CAAGAGCAAC 


2520 


TAATTTTTTC 


TTGATTTTAA 


GCATTTTTTT 


GCTCATATGC 


GCTTGGTCTA 


GCACGCCCAT 


2580 


CGCTCGTTCG 


TGGTGGGTTG 


ATT C AAC AAA 


ATTCTGGCGC 


ATGGCATCCA 


GCTTTTCGTG 


2640 


TAAGTATTGT 


TTATCCATGT 


CTGTATCTCT 


CTAATTTTTC 


AATCATCACT 


AAAAACGGCG 


2700 


GGTTGTTGAC 


TTGGTTTAAA 


GTTCGGTAAA 


TGGCAGCTGT 


GTACTCTTGT 


TGGTTCAACT 


2760 


GGATCACAAA 


AT C C AAG AC A 


GCATCTCTCT 


CGAGATCGCC 


TCCTTCATGA 


CCATAGTAAA 


2820 


TCATAATAGC 


AATTCGTCCA 


CCTTTGACAA 


GTAAGCCACA 


TAGCTTTTCT 


AATGCCTCAA 


2880 


TCGTTGTCTG 


CGGTCGGGTG 


ATGACAGACT 


TATCAGCTGC 


CGGCAAATAG 


C C C AG AT T AA 


2940 


AAATCCCTGC 


CTTAGCTTTT 


ATCACAAACT 


GGTCCAGTGT 


CTCATGGCCT 


TGCAAGATTA 


3000 


ACTGGGCATT 


TGTCAAGTCA 


GCCTGATGCA 


AACGCTCTTG 


GGTCTTTTCC 


AAGGCTTGCT 


3060 


TCTGAATATC 


AAAGG CAT AG 


ACTTGCTTGG 










CATTTCCCAT 


AGTCGCATCC 


ACTACGACAT 


CCTCTTTTGT 


CACGACCTCA 


GCCAAAAAAT 


3180 


CATGTGCCAT 


CTCAAGTGGT 


CTTTTCATTT 


TCAAACTCCT 


GTTTTACAGC 


CTTGCATCCT 


3240 


TGAACACTTC 


CACGACGTCG 


CATCTCCATC 


TCAATGCTGT 


TGAGGACTTC 


CCATTTATTG 


3300 


AGGCTCCACA 


TAGGACCAAG 


C AG CAT AT C C 


CTAGGCGCAT 


CTCCTGTAAT 


TCGATGGATG 


3360 


ACGATATGTT 


TGGGAATAAT 


TTCCAGTTGG 


T C AC AG AT G A 


CCCTGACATA 


TTCGTCCTGA 


3420 


CTCATCAATT 


GTAAACGCCC 


CTCATGGTAA 


TCTCGTTGCA 


TACGAGTATT 


TGTCATAAGA 


3480 


TGGAGCAAAT 


GCAGTTTAAT 


CCCTTGAATA 


TCGTTATCCG 


TGACACAACG 


GCGGACATTT 


3540 
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TCAACCATCA TCTCATGGGT TTCACCAGGC AAACCATTGA TCAAATGGGA AACAATCTCA 3 600 

ATTTTTGGAT ACTTTCTCAA ACGCTTGACC GTTTCCACCT ACAATTCATA AGAATGCGCA 3 660 

CGGTTAATCA GGTCAGAGGT TGCTTCATAA GTAGTTTGCA AGCCCAATTC AACCGTCACA 3 72 0 

TGCATGCACT CCGATAACTC AGCCAAATAT TCGATGGTTT CGTCTGGTAA ACAGTCTGGG 37 80 

CGCGTTCCAA T AT TG AT T C C TACCACACCT GGCTCATTGA TAGCCTGTTC ATAACGCTCT 3 840 

CGAATAACTT CCACCTTTTC ATGGGTGTTG GTAAAATTTT GAAAATAAAC C AG AT ACTT C 3 900 

CGAACATCCG GCCACTTGCG GTGCATAAAG TCAATTTCCT TATAAAATTG CTCACGGATA 3 960 

GGCGCATCCG GTGCCACAAT GGCATCTCCA GAACCAGAAA CCGTACAAAA AGTACAGCCC 402 0 

CCATGAGCCA CAGTCCCATC ACGATTGGGA CAATCAAATC CCGCATCAAT AGGGACTTTA 408 0 

AAAGTCTTTT CTCCAAAGAG TTTTCGATAA TAATCATTCA AGGTATTATA AGATTTCATG 4140 

ACTTTCATTA TAACAAAAAT CACCCACAAT CTCAAAAGCC TGACTTTCCT ATAAATTCCT 42 00 

CTGTTTCTCG TTTCCATTAG CCTTTTTTTA TGATACAATA TGGGTATGAT TTTAATGAAA 42 60 

TTAGCATCTA TTTTATTATT GATACTGACC TTAGTCGTCT GCATTATCCT AACCAAACTT 432 0 

TT T AG AT T AA AAAAACTAGG ACGAAACTTT GCGGATTTGG CTTTTCCAGT CTTGGTATTT 43 80 

GAGTATTACT TGATTACAGC TAAAACCTTT ACCCATAATT TCCTCCCTAG ACTGGGGCTA 444 0 

GCCCTCTCGA TCCTAGCCAT TATTCTCGTC TTTTTCTTCC TTTTGAAAAA ACGCAGCTTT 4 500 

TACTACCCTA AATTTATCAA ATTCTTCTGG CGTGCAGGAT TCTTATTAAC CCTTATCATG 456 0 

TATATAGAAA TGATTGTTGA ATTGTTCTTA ATGAAATAGT CGAATCCCTA AGCATTTTCT 4 62 0 

AGGGATTTTT GCTTTCTCTA CAAAATAGTA TAGACAATAA CACTATACAA TTTTATACAA 46 8 0 

AGAAAAGAGT CTGGGACAAT AGTCTCTTAT AT C C AAAAAG GCAACGGATT TGCCGTTGCT 474 0 

TTTTTGGATG GTTACGATAG TCTTGGTAAA ATAGAATTGC CCAATAAACC ATTTAGAAAG 4800 

GCTATCCCAT GCAT AT T C AC TATAACACAA ATCAAACAAC TTTACCACTA GAAATCAGTT 48 60 

CCTTCTTACC ACAAGATCAT CTCGTTTTTA CTATTGAAAA AGTGGTGAAT ACCTTGGAGG 492 0 

AACGTCACTT CTACACCTCC TATCATGCCT TTGATCGCCC GTCTTATCAC CCTAAAATGC 4980 

TTGTATCTAC TCTTCTATTT GCCTATTCAC AAGGGATTTT CTCTGGTCGA AAAATTGAAA 504 0 

AATGGAAGAG TTAGTGACCT TAGATTGTTT GTTTATTGAC AGAACTAAGA TTGAAGCCAA 5100 

TGCCAACAAG TATAGTTTTG TGTGGAAGAA AACGACAGAG AAATTCTCCG CCAAACTTCA 5160 

AGAACAGATA CAGGTCTATT TTCAAGAAGA AATCACTCCC CTTCTGATTA AAT ATG C CAT 522 0 

GTTTGATAAG AAACAAAAGA GAGGGTATAA AGAGTCAGCT AAAAAC TT AG CGAATTGGCA 52 80 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA 53 40 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 54 00 

CGAACCTGAA TCAGCCCCTC AAAAGGG AC T GTATATGAAC GAACGCTATC AAAACTTGAA 54 60 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TT GGGC AG AT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 558 0 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5 64 0 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 57 60 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 588 0 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 5 94 0 

ACTAATCCTG CACTCAAACC T AG AAGAGT T AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GC AC T AC C TG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 60 60 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GGTTTTTTAG CTTCTTGAGC TACTGGTTTG 612 0 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 624 0 

TCTACACGAG TTTCACCTAG ATACAGTGTT GAATCTTTTT TCTCAACTGT CTTGTATGCC 63 00 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 63 60 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 642 0 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6 54 0 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6 600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTGCTTTACG CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 672 0 

AATGTTACGT TGATTTCATC ACCAGCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 67 8 0 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAG 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCGTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 696 0 

TT AC C ATT TT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 702 0 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 
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TCATGAGAGT TTTTGTCAAT ATCATTGGTT GCTGAAGCAA AGGCCTGGTT ACTGTTTTCA 7140 

T C AT AGT TAG GGTTATCTGA AAGAGTCTCA CCAAGTTTGT CTGTCACTCG TACAGTGATC 72 00 

TCAGCAACAA GGTTACTACC AAGGACACGG CCTCGAACAG T AAATTG AC C TGCTTTTGTC 72 60 

AGATTTTCCG CTGGAACTTC TTCCCATTCA ACTGTCAGGT CTTTTGTTTC GTAGCCGTCT 7 32 0 

TTACCTGTGA AGTAAACTGG AACCTTAGTC GGCAATTCAA GTGCTTGACC TACTTGTAGC 7 3 80 

AAGCGAGCTT GTTTAACCGC AGCAACTGGT TTATGAGAAA GTAAGCTCTT ATCCTTAGTG 7440 

AAGTGCAGAC GGTATTCTCC TAAGATGTCG CCATTTTCAG CTTTCGCGAT GACACGAACT 7500 

GGCTCACCTT CACGAACGCT TGGAACGACG GTAGCGAGAC CATTGTTGCT AACACTTGCT 75 60 

GTGACTGCCG GAACTTTTCC AT C T AC AG AC TCAAGGTAGT AGTCTGTCAA ATCAGGGTTG 7 62 0 

AAGTTTGCTA AGTCTTTGCC GTCAACTTGG ATTCTTGTTT GTCCTTGCTT GGCTGCCGCA 7 680 

ACTTGTTTCG CAAAGATTTG TACCTCTGTG ATAGACGTTC CACGCTTGTT ATCTGCTTTA 7740 

ACCATGCGAA TACGAACAGC ATAGGTTTCA ACTTTATCAA AGCTAAAGTG GTTCATTTCT 7 800 

CCAGCCTTGA GTTGAGCAGG GGCTTTTAGA TTAGTAACTG GTTTCCAGTT GGCAGAATCA 78 60 

TTAAAGACAT GGTCCTCATT ACCAACAAAA CTAGGGTTTT TAGGAGCTGT TGGGACAGTC 7 92 0 

TTACCAACAT AATACTCAAT CACATAAGAC TTCGGTACAC CAACTCCATG GTCTTCATGG 7 980 

AATCCGACAC TTAGATTATC AACGGAGCGT TTGCTCAAGA TACCTGAATC TCCAAACAGA 8 04 0 

AC AC C G AC T G AAGCTTCTGG ATTAGTACGA TTCCAGTTTG TCCAACGATT GGCTGGTTGG 8100 

TTATTGTAGG AAATGAGCTT GTCATTAACA TTTGAAACTG GGTCGCTTGG ATTTGAGTCT 8160 

GAAGCAAAGG CAAGTGGCAA TTCTGAACCG GTCCATTGGT CAGAAATGTT TGCACCTTGC 82 2 0 

TCAGTTTGAG CAGATACGCG AACATGAAGT TTAGTTGTTA ATTGCGTACC TTCTAAGCGA 82 80 

CCATTAACTG TAAAGACACC TTCCTTAGCG TATTGCTCTG GACGAATCGC ATCCCATGCA 834 0 

ACCTTAGCTG ATGAAACGTG ACCATTTGAA TCATATGTCC GAACACTTTC TGGTAATTGT 8400 

GGTGCTTCTG CGATTGGAGT TGTCACACTG ACTTCTTCAA CTGAAACGAT ACCTTCTACA 84 60 

GAGACTTTTG CACGCGCTTC AAGGTCAATT CCTTCAACTT TACCTAGTAC TTCAAATGTT 852 0 

TGATAGGAGT CTAGTTTTTC TTTCGGAATA GCTTGCCAAG TGACTTTATG AGTTTTAGGG 8 5 80 

AAACCTTTGT CATACTCAAC TGTTACTGTT GCTGGAAGAC TTGGTTCCTG ATGCAAATCT 8 640 

GT CACT AC AT TTACAGGACG GATGGATTGC GCAATCTTCT TCTCAGTATT GGCTTGGATA 87 00 

GTGAGTTCAA CTTGGTCTTT AGCTCCCTCA TATTCAGCGT TCAGAGTGAC TGCTCCTGGC 87 60 

TTATGCAACT CAAGCATTCC TTTACGAATT GCGACTTCCC CTTCACCACT TGTAGAGAAG 882 0 
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GTTACTTTAT CAGCTGGTAA TACAGCTTGC GTTCCATCTT GATAGTGAGC TCGAACCGAC 8880 

AATTTGACAG TTTGGTCTTC TTTGAGACTG TCAGCTTTTT CCACTTGCAA GCTCAAGTGA 8940 

GCAATTTTTG GCGCTTCTTC AAGGAATTGA ATTGCATAGG TTTGAAGAGG GCCACCATCT 9000 

TTAGGCTGAA TAAAGATGCT CGCACGCATG CCGTTTGCTG CGCTTGCTTG AAGAACTGTA 906 0 

ACAGCTGCAT TTTTAGCACT TGCTGTGACT TCTGGCAACT TAGCTCCATA AGCAAGAGTG 9120 

CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG CCTCACCACC AACCGTTACA 9180 

GTTGGTACTG CAGGTGCCGC AGGATTGCCT TCTTCTACCA CAAGGGTTGC ATGAATTGGT 924 0 

TGACCTTCTA AATAACCGGT CGCTTGAATA CGAGAACCTG GAATTGCTAA CTTAGCTTTA 9300 

TCTTCTTCGG CAATCTCCCA CTTGTCCACT TCATACTCTT CAACACTTCC AT C AAT C AAA 93 60 

ACATAGGAAA CAGATTTGTC TACAGAATTC AAGTCAGTAT TTGGAGCAAT ACGTTTCACA 942 0 

ACTGGTAGCT CTGATTTAAG AGCAATCACT TCTACACGAG CTTCTACTTC TCGTCCGTCA 94 80 

GCCATACCTT TCACCGTTAC AAT AC C AGG C TTGCTCACAT CTACTGAAGA CCAGGTTACA 9540 

GGACGTTCTG CACGGCTACC ATCACTGTAT ACAAACGGAA CAGTGGTAGG CATTTCAGGT 9 600 

GCCTCTCCAA TAATGGTCTG TACTTTTGGC ACTTCTGTCC CCAAAACAGT CTTCTCTTGT 9660 

CCTTCTTTCT TACCAGTAAA GACAGTGACT TGGTTCGATT TCAAGAGATC AGAGTGGGCA 972 0 

GTCAGGGTGA ATTTCCCTGC TTGTTCAGTT GATTTGACAA TGGCAACACC TTT AC CAT T A 97 8 0 

AATGCTTTAC GAATCCAAGA ACCATCTGCT TGCGCCTTAT AGCGTTCACG GCTGGCTTGT 9840 

TCTCCGTTAT CTACACCGAC CAGTTGACCT TGGCCATGCA ATTGGAAGCG AACCAGATTA 9 9 00 

TTAGCAGTTG GAACCACATT CCCCTGGCTG TCAACAATTT CATAGTAGAT GTAAGTCAAG 9 9 60 

TCTTTTCCAT CTGCTGCAAT CGCATGGTCT TCCTTAATAA GACGAACTGC CGCTGGCTTA 10 020 

CCAGCAGTCG TAATCTTATC TCGAGCAATT TCCTTGCCAG AT TC AT C AC G AGCAATTGCT 1008 0 

TCCAAGGTAC CTGGTTGATA GGCAACTTTC CATTCAAGAT AAAGTTCATT AGCATTTGCA 1014 0 

CCTTCTTGGT AAGTCCGCCC ATCGCTGGTT TGTTTTTTAT TGAAAGTCTT AAGACCAAGA 10200 

GATTTTCCAT TCAAGAACAA TT C T AC ACT A GAAGCATTCG AATAAGCACG AACTGGAATC 10260 

TTACCTTCTG AGTCAGCTAC TTTGGATGCT AATTCTTTGT TTTCCCAGTT CCAGTGAGGA 103 2 0 

AGAAGGTGTA CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 103 8 0 

TCATGTTTTG GAATGCCGGC TGTATCTACG ATACCAAAGT AAGAGCTCTT AACAGGAGTT 10440 

TGATTTTGGT TGTGCCATGG TGTAGGTTCA CCAATATAGT CCGTACCTGT CCAGATAAAC 10500 

TGTCCAGCAT AGCCAGCGTT GTCACGGTCA AAAGTCCATG AAGCGGTTGC TGTTTTCCCC 105 60 

CAACCCACAC GATCATTTCC ATAATCTGAC TGTTCATAAT TACGCTCAGG TCCATTGCTA 10620 
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TGTTTCAATT CACGTTCAGG GCGATAGTAA CTTCCACGTG TACGGGTAGC TGAAGATGTT 10 6 80 

TCTGATCCAT AAATCAACCA TTTTGGATGC TTAGCTCTAA GGGCTTTGTA ATTATCTTCA 10740 

GAATAGTTAA ATCCAACAGC ATCGAGTTCA TCAGCAATTT TCTCATGCCC TCCGCTACCA 108 00 

TTACCGAAAC GGAATTTATC TGCT C C CAT G GTAACATAGC GAGTCTTATC AACATCCTTG 10 860 

ATAACCTTAA CCAAACGTTT AACAGTTGCT AAAGAGTGGG CATCACCATT AGCTTCACCT 10920 

AT T T C ATT AC CAATTGACCA CATGAAGATA GCAGGGTTGT TTTTGCCTCT TTCGACCATG 109 80 

GTACGTAGGT CAAAATCAGA CCATTTTTCA CCTTTTCGAG CTTCTGGGTG AGTGGCATCT 11040 

TTTTCAAAGA AACGTCCATA GTCATAAGGT TTCTTGCCAC CATACCACGT ATCAAAGGCC 11100 

TCTTCCTGAA CGAGTAAACC TAGTTCTGCT GCGATTTGCA AGGTTTGCTC ACTAGCAGGG 11160 

TTGTGGGTTG TACGGATGGA GTTAACTCCC ATCTCCTTCA TTTGTTTGAG ACGGCGATAT 1122 0 

TCTGCTTTAT AGTTTTCTTC TGCTCCAAGC GCCCCATGGT CGTGGTGCAA GGATACTCCA 112 8 0 

TGGAATTTAA TACGTTCACC ATTCAAAGAG AAACCTTCAT TTGGAGTCCA GTGATAGTAA 113 4 0 

CGGTAACCAA ACAAATCCTT CTTAGCATCA AC C AATTG AC CGTCACGGTA AACACGCGTA 11400 

ATCAATTCGT ACAAGGCAGG TTTGTCATTT AAAACAGTCC AGAGTTTTGG TCTTTCAACT 114 60 

TCTAAAATCG CATCTAGGCT TGTTGATTCA TGTGCTTTTA AGGTACGACT CGCTGTACGA 1152 0 

ACTAAGCCTG TTACAGCATG ACCACCTCGT TCAACGATTT GATATTCGGC TACAAGTTCA 11580 

TGGTCTTTGT CGTCCGTATT GACGATTTTG CTGGTCACAT GAGTTTCAAC CTTGCCATGT 11640 

TGTTGTTCTT CAAGTTTTGG TGTTAAAATA GTTGTCCCAT TTTTCTCAAC ATGCACCTTA 117 00 

TCTGTCACTT GTAAAGTCAC ATCACGATAG ATACCACTTC CTGAATACCA ACGGCTACTT 117 60 

GGCTGTTTGT TGACTGCATG GACAGCAATC ACATTCTCAC GACCATCTTT TTGAAGGTAT 11820 

TTGGTGATAT CATATGAGAA CTGGTTATAA CCATTTGGAT AATGCCCCAC TAACTGACCA 118 80 

TTGACATAAA CTTGAGAATC CATGTAGACG CCATCAAAAG TAAGGCGAAC ATTTTTCTTG 1194 0 

AGGTCTTTTT CATCTAGTTT GAAAGTCTTG CGATACCAAG CTTCCCCACC GTTGAGCTGT 12 000 

CCACCTTCAT TTTGTGCAGG AGATTCATGA TCGAAATCGT TAAAGATACT CCAGTCATAC 12 0 60 

GGTAAATCTA ATTTTTTCCA CGTAGATACG TCTGCATCAG GTTTAATGGC TTCCTTAGAA 1212 0 

TTTGCATTGA GTTTAAAGTA CCAATTTTGA TTAAAATCCA CTTTCCTGTC TTCAATCATT 12180 

TGATTCACTT CTTCATTTGT TACAGCTTTA GCATCTTCCT TGAGCGGTTT TTCTTGATTT 12240 

GAAGCTTGTG ATTCTATCCT TGGAGCTTTT TCTTCCGGTT TAGCAGACAC TTTTTCCTCT 12 300 

TTTGGAGTTA CGGCTTCATC TTCTTTCTTC TCAGATGCAA TAGCCTCAGT TGAACTAGGT 12 3 60 
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TCACTTTGTT CTGTCCTTTC AACTATATTT TTAGTTTCCA AAGCTTTATC AGCCTTTTCT 12 42 0 

TCTACTATCA TTTTTTCCTC TTTAGGTTTC TCAGCAGTAT GAGTAATAAG TGTTTCATCC 12480 

GCATAAACTA CAGATTCTCC AGCTATATTT CCTCCTAATA AAACTGCACA AGTCCCAATC 12 54 0 

ATTACTGAGC AAGCTCCCAC AGCAAACTTA CGAATGCTAT AAACTCTTTT CCGATTCCAA 12 6 00 

TGGCCTTTCC CCATAAAACC C T C CT T AT AT TATATTTAGT GCAGTTAGCT AC T AC C AAAG 12 6 60 

CCCAAGTGGT ATACATGGTA TGACAACCTA GTTTCAACAA TTTACACTCT GCGAAAATCC 1272 0 

AATTCAAACT TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTCA 127 8 0 

T C T AC AAC CT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA 12 840 

CCATGTTTTG AGCTGACTTC GTCAGTTTCA TCTACAACCT CAAAACCATG TTTTGAGCTG 12 9 00 

ACTTCGTCAG TCTTATCTAC AACCTCAAAA CTGTGTTTTG AGCAACCTGC GGCTAGCTTC 12 9 60 

CTAGTTTGCT CTTTGATTTT CATTGAGTTT ATATTTTATA GGAGCGCATT ATTTTGCTTT 13 02 0 

TGCTGCGTAC TCTTCGTTAC GTT T GAT CAT TTGTTTTCTG TACCAAGCAA AGATACCGAT 13 0 80 

ATAGAATACA AGGAAGACTA CTGCACCAAG GATTGCTTTG AT AT C ACC AG TTGTAGTGTT 1314 0 

ACCAATTGTC CAACCAAGAA GTTTTTCGAT TGGTCCTTCA AGAGTAGAGT GAGTAATCAA 132 00 

TTGAGTTTGG CTCACACCTT CTGGGAAGGC ACCTACACCT TTAGCAAGTT CTGTTGCAAA 132 6 0 

TGGTGCAATA AGTGTACCTG AAAGAAGGAA GAGTGGCAAC AAGAGTGTTC CGAAGATAAT 13 320 

CATACGGAGC AATTTACCAC GAGTTACAAC CAAGAGAGCT GGAGTAACAC CCATAGCGAT 13 3 80 

GATACCTGCA AGTGGCAAGA T AC C AT TTCC AACTTTTGAA AGAAGCACTG CTTCAATCAA 1344 0 

CATGATTGGT GCAAGTACGT TGGCACAAGC CCAGATTTCA GCACGACCAG CGATGAATGG 13 500 

CCAGTCAAGA CCGATATTGA ATTTACGTCC TTGAAGACGT TTAGTAGCAA CGTTTGTAAT 13 560 

ACCTTGTGAT AGTGGTTCTA CGGCTGCGAT GAACCATGAA CCGATAAGTG AGAAGAGTTC 13 620 

CAAAGATACA CCGGCAGTCA AACCAAGAGA CAACCATCCT TTGATAACAA GACGCCATTT 13 6 80 

ATCTGCATCT GCAACACCTG CAATTGGATG TGGAGTTCCC ATAAT AC CG A TAACGATACC 13 740 

AAGGATGAAA CCGATGAAGA ATTTAGATCC CCAGAAACCG ATTTTCTTGT TCAATTTAGC 13 8 00 

AGC AT C AAAG TCATATTTAT CAAGGCCTGG GAAGAATTTT TCAAAAATCT TATCCAAAAC 13 860 

CATGATAACT GGGTTCATCA TGTAGTTCAT GTGAGTTGAT GTCATTGGTG ATGAACTTGG 13 92 0 

GGCGTTAAGA AGGTCATCAA ATGTAGGTTT CATCAAGTCA GAGTTGATAA TTTTCAACAC 13 980 

ACCGACAAGG ACGATAGCTG CTGTAGCAAT AAAGAGTGAA ACCCCTTGAC TC AC ACC AT T 14040 

GTTATCAGCA TACCATTTAA TCAAGAGACC TGTGATAGAC AAGTG C C AG A TATCAAAGAT 14100 

ATCGAC AT C A AGTGTATCTG TTTTCTTCAT AGCTAGCATC ACTATGTTGA CAATCAACAT 14160 
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GATGAGCAAG AAGTATAGTG TCCAAGCAGA ACCCCAAGTG ATTGTAGCAA GTGGTGCCCA 1422 0 

ACCAACGTCG GT AAT ACT C A ATTGG AT AC C AGTGTTTTCA ACGAATTTTG CTAGTGATGC 142 80 

TGAGAAAGCA GTGTTTAGCA TACCGATGAT AGCACCGATA CCTGTAAGAG CGATGGCAAG 14340 

TTTGATACCA CCTTCAAGCG CTTTGGAGAA TTTCACTCCA AAAAGTAAAG CCAATACTGT 14400 

CAAAATGATT AACATGATGA CAGGTCCACC CATTTCTAAG ATGGG AT T G A AAACCTTTCC 144 60 

GATTAGGTCA AAGATTGCAT CCATAACAGT TCCTCCCTTT TTGATGTTAT ATGAATGTTA 1452 0 

ACAAATTAGA ATTAGCTTAA TCCGTGTTCT TT AAT AG C T G CTTCAATATT GT C AAAT ACT 145 8 0 

GGAGCGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGG AT AC C T 14 640 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 147 00 

TCTTCGTTTA CATCTTTCAC CATGACTGCA TCACAGTGAA CATCATAACC ACGGTTTGAA 147 60 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 14 820 

GCAAGAATTT TAATCATTTA GATTTCCTCC GAT T TT AT T T T TT AAT AG AC AAGATTAAGC 14 8 80 

GGTTGCTTCA GCAATGTAAG TATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 14 940 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AG AAT GT T C G TTTGACTTGA 15000 

ACTTGAATTA TTAATGATAA AGAAGAGTAG GGATACTTCT ACTTCCTTAT CAGGAGCTAT 150 60 

CATATTGTGA AAAGTTATTG GTTTTTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 1512 0 

ATGAACAATA TCTGTGTGAG GAATCGCTAC ATTTGGCAAG TCCTTTCCTA G AAAT T C C AT 15180 

ATCTAAACCA GTTGGAAATG ACTTTTCACG CGTGATCAAG GCTTCACGAT AAGTTGGAGT 15240 

GACAATTTCT CGTTCTTCCA ATAAAGTTGC AACCTGATCA AAGAGTTGTT C TTG ACT AT C 15300 

CGCTTCTAAG CAAAACACAA GGTTTTTGTC AAAGAAATAA TCTAATACCA TAAGTTTTTC 15 3 60 

CGG 15363 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TAAGACTATT TAATAGTGGA GT G AAAT AGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 

GAATT T AT AG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA AC GC TAT AG A 12 0 
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AGCAGCATTG TGCTAGTCkA GATTCAGTTT ACTATACTAA AACGAGTAGC TTGAAATCAA 180 

AAAACCCACC CTCACAGGCA GGTTTTATCT GT ATT ATT C A GCTAGATTAT GCTTTACCTT 24 0 

CTGAACCGAA TACGTCGATA CGTTCTTCAA CCGATGCTTG GATAGCTTTT ACACCGTCAG 300 

CCAAGAATTT ACGTGGGTCG AAGAGTTTTT TCTTGTCGTA TTCTGCTTCG TTTGCTTCGT 3 60 

AGTCACGAGC AAATTT AC G A GTTGCGTTAG CGAATGCGAT TTGGCATTCT GTGTTAACGT 42 0 

TAACTTTGGC AACACCAAGT TTGATAGCTG CTTGGATTTG CTCATCAGGA ATACCTGATC 4 80 

CACCGTGCAA TACGATTGGG AATCCTGGAA GAGCTTCTGT CAATTTTTGC AAGTGGTCAA 54 0 

GGTCAAGACC TTCCCAGTTT ACTGGGTAAG G AC C GT GG AT GTTACCGATA CCAGCTGCCA 6 00 

AGAAGTCGAT ACCAGTTTCA ACCATTGCTT TAGCGTCTTC GATTGGAGCC AATTCACCTT 6 60 

TACCGATGAT TCCATCTTCT TCACCACCGA TAGTACCAAC TTCAGCTTCT AC TG AG AT AC 720 

CTTTAGCGTG TGCTTTTTCA ACAACTTCTT TAGCCAATTT AAGGTTTTCT TCAACTGGAA 780 

GGTGTGAACC GTCAAACATG ATTGAAGTAT AACCAACTTC GATACACTCA AGTGCATCTT 840 

CGTAGTGACC GTGGTCAAGG TGGATAGCTA CTGGTACAGT GATACCCATT GATTCAACAA 9 00 

GGTTAGCGAT CAAGTTGCGA GCAACTTTGT AACCACCCAT GTATTTAGCA GCACCCATTG 9 60 

AAGTTTGGAT CAAAACTGGA GCTTTTTTAG CTTCTGCTGC GCGCAAGATA GCTTGAGTCC 102 0 

ACTCAAGGTT GTTTGTGTTA AATCCACCAA CTGCATAACC GTTGTCACGG GCTGCTTGGA 10 8 0 

CAAATTTTTC TGCTGAAACG ATTGCCATTT TATCAGGCCT CCTGTATATT TTTATGGGTC 1140 

ATCCCATTTA CATTGTTCAT TTTATCACTT TTTGCCAAAA AAATCTAGTT TTTCCCGCAG 12 00 

TTTCGATTGA TTTTCTTCTA ACTCCATCTA TGTAAACCCT TTCTCTCCCT AGTCTTGGAC 12 60 

GACTTTTGGA AAATCTATAA AGAAGGTTAA ACTATTCTCC TCCATCTCGA AACGATAAGC 13 2 0 

TAATTTTTCA TGTTCTAATA GACTCTTAAC C AC AAAG AG C CCCATACCAG ACCCCTTGAC 13 8 0 

CTTGCGACTG GCATTGTCAG AAAAAG ACT G GGCTAGTTTT TCTTGTTCCT CTGAGCTACA 1440 

GCTATTTTCG AT AAAAAGT T CTCCTTCTCT TTCTCCAATT CGAACTAAGC CACCTGGAAC 1500 

AGAGTGCTTA ATGGCATTGC TG AT GAG ATT AGAAAGAATC AACTTCATAA CTGATGGGTT 15 6 0 

TAGATAAGCC TGCTGATGGG TCAAACTATT GTCTATCTGG AGCTCTCTTT CCTTGGCTAG 162 0 

CAAGGCATAA TCTTTGACCA GATTTTGCGT CAT C TGG AGG AGGTCAATTG TTTCCCTATC 168 0 

ATCTCGCAAT TCCTGCACAG AAGAGAGGGA AAGTATCTGC AGAACATGGT GATTGAGTTC 174 0 

ATCCACAATC CCCAAGGCAA CTCCCAGATA CTGGTCTCTA TCCTTATAAC G AC CGAT AT T 1800 

CTCTCTCATA TTTTCGATTA GGATTTTCAA ACT AG C C AGC GGTGTTTTCA ATTCATGAGA 18 60 

AGCTCCTCGT AGGAATTCGA CCTTCATCTT CTCCAGCTGG AGAATGGCTT CATTCTTTTC 192 0 
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ATGCAAGTCC GCAATAACAG TCAAGAGATG CTGGTAGAGG CTATTGATTT GTTCCTTGAG 1980 

ATT ACC T AT C TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA GGTCCATCAT 2040 

CCGACGGGTC ACCCGCTTGA TTTCCAAAAT CGGTGCAACA ATAGTCCGAG CGTAGATGTA 2100 

GGCCACCAAA AGGGAAATCA GAAAGGAGGC CAGCAAGGTA TAGGGAAGAA ACTGGAGACT 2160 

GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCATAGTACC 2 220 

ACCGTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 22 80 

GTCCAGAGGA AGACTGTCCT TGACTTCTAA CTTGTCCTCG GTCATCTCAC CTTTGACGGT 2340 

CCCCTTGATA T C ACT AG T C T GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 2 400 

TTTCCCTTCT AGGGACTGGG CAATGGCTGT TGCCTTTTGA CCAATGGTTT CCTGACGATG 2460 

AC TC AG AT AA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 2 52 0 

AC T AAAT AT C GAGAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 2 580 

CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 2 640 

CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 2 7 00 

CAGACGGCAT CGATAATCTG AGATCGAGTC AAGGCCCGGC CTTCATTTTT CACTAGATAG 27 60 

TCCAGAATTT CCAACTCTTT GGCATTGATA GGCACTTCTT G AC C TG C GAG GCTTGCACTG 2 82 0 

TAGCTTTCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCGTAGTAG 2880 

CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA GGGAGAAAGG TTTTTCCAGA 2 94 0 

TAG C CAT C TG CCAAAGAGGC AAAGGCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 3 000 

AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAG CTAGGACTTC TAAGCCGTTG 3 060 

AGCTTGGGCA TCTGGATATC CAGTAAAACC AGGGCCACCT CATAGCTAGA AAATTGCTCC 312 0 

AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 3180 

CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAAT TTT CAT ACTTTAACTG 3240 

CTCTCTATTT TTTATTTTTC T T AG AAT AAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 3 3 00 

GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA CGTTGTGAAA TTCCTTTCTC 3 360 

ATAAATTCCA TAACTTTAGT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 3420 

AAGCAATGAT TTT C AC C AC T GCTTTCGGAT TTATTTTGAA TTGTTAAATA GCCATTCCTA 3 4 80 

TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 354 0 

TTT ATT C ACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGG AG AT T G CTTGAAGCAA 3 600 

GCGCCATAAC GAGAACCACT AGAACCAAGG CAAGGACAAA AATGATGATA AAGTCTGATG 3 660 
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TCTGAATGGA AATGTCTAGG CTCGACAAGG TCTTGCTAAA GCCATCTACT TCTGCACCAC 3 72 0 

CACCAAGGTT AGAGGCTTGA GCCGCCTTAC TAGCCTGTTT GGCAACACCT GAAGTCACAT 3780 

TGGCAAGGAC AGTGTTTCCA ATTGCACGGG CAGTGTAATT AGCTAGGAAG T AAG C AG AAA 3 840 

CTAGAGCAGG GATAGCAATC AAGATAGATT CGGTGATGAA TTGACCCAAG ATACTTGCCT 3 900 

GCTTGAGGCC GATAGAGAGG AGAATTCCCA CTTCCTTGCG ACGGGCGTTG ATCCAAAGGC 3 960 

TG AG C AAG AG GGCAAGGAGG AGAACTGAGA AG CT C AAG C T ACCCCAGAAG AGGAGGTTGG 402 0 

CCATCTTGTA CATACCAGAG ATAGATTGCT CAAGAGCTGG GTAGTTAGAG GAGCTCTTGA 40 80 

CGAGTGTGTA GCTCTTCCAG TTGATACCAC TGATGCCATT CAACTCTTTC ATAACATCAT 4140 

CCAAGTTCTT GTCTGCTGTT ACAAAGAAGG TTGCGTCCCC ATAAATGGCT GTGTCTTCTG 42 00 

TGTATCCATA AAGTTTTGCA GCAGTGTGAA TGTCTGTAAT AGCTGTGTTT TCGTAAAGTT 42 60 

CTTGTGAGTA GGTTACTGCT GACTTATTAT GACCATCAAA GAGTCCCTTG ATTGTCACTT 43 2 0 

CAACTGTTTC CTTGGCTCCT TTTTCATTAT CTGCATCGTA GATATTAGAG TCCAGTTTAA 438 0 

CCTTGTCCCC TACTTTCCAG CCGTGTTTGG CTGCCAAGTC CTTGTGCAAG AGGATTTTAT 4440 

CCTTGTCGTC GTTGGTTAAG TGCTCTCCTT CG ACT AG T TT ATAAGAACCA GAGACAAACT 4500 

TGTCTTCTTT AGAGGAGTCA TTGACACCTG TAATCATCAA GCTACTTCCA AAACGCTTGG 4 5 60 

CACGATCAGC AGTGAGATTC TTCTTGGTTT CTGGCGTTTC AATCAGGTCA TATCCAGTCA 4 620 

AATCTCCGAT AGCGTTGATA CGTTTGACAT AAGACTCAAT GGCCTTGTTT TCGGTGATTT 4680 

TTTTGATGTC TTCACCCTTG AT ATT C C C AG CACCACGAGG CGTTCCTTGG TTGACGCGAC 474 0 

GATTGATTTG CATGGAGAAG CTATTGGTGA TATTTTTAAA GGTCTCCTGA GAAGCCTTGG 4 800 

CAGTAGCTCC CTTGATTGAC AAGCCGACCA AACTCAAGCT CGCCATGAGG AGAATAATCA 4 8 60 

GGAAGATGAC AATCGATTTG AAAAACTTCC TTGTAACATA GGCAAATGCG TTGTGTAACA 4 92 0 

TAGATTCCCT TTCTAGATTT TGTTTTAATC AT T C T AT T AA AATAAGCTCA AATT ATT T AC 4980 

TAGTATTGCG CGTTTCAGTC AGTTTCTTAT CCTTTAATTC AAGTGTAATA TCTGACGCTT 5040 

GTGCCACTTC TTTACTGTGA GTTACGACAA T C AC AC AT T T ACCTGTTTTC TGGGCAAGTG 5100 

ATTTGAGTAG TTCGACAATA TCTCCAGCAG TTTTAGGATC C AG AT TT C C T GTTGGCTCAT 5160 

CAGCTAGAAT AACTGGAGCT TCTGAGACCA AACTGCGAGC AATGGCAACA CGTTGCTGTT 522 0 

G AC C ACCTG A TAACTGGAGA ACATTCCGCT TGATCTGGCT T TC AT CC AAA CCAAGCTCAA 52 8 0 

GAAGTGTATT CTTGCTTGCC TTTTTGTTGA CCAATCGGAT ATTTTCCAGC GGAGAAAGAT 534 0 

AATCTATCAA GTTATAATTT TGAAAGACCA GGGAAATATG GTGCATGCGA TGGTAAGAAT 5400 

AGCCCTTCTT ACGAATATCC TCTCCTTGAA AAAGGATAGA ACCTTCAACA GGACTATCTA 5460 
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GACCAGCAAG TAGGGACAAG AGTGTGGATT TTCCTGCTCC TGACTCCCCA AT AAT AC TGT 5 520 

AAAATTTTCC GGGTTCAAAA T T AT AAT TG A T CTG AT AT AG GACTGCTTCA GCAGTATTCT 55 80 

TATAACGGTA GGTAACATCT TGT AAT TGT A ATAAAGTCAT GATTTCTCCT TCTTAACTAA 5 640 

TAG AT G AT AA AATTTCTTTC GGTGATTTTC TAAATAAGAA TAGGAAACAA AGGGCTACAG 57 00 

ATAAGCAACT AAGCAGAACT AGAAAAACAT AGGATTCTGC AAAAGATAAG ATGCTAGTTG 57 60 

ATAAACTGCT TGCTTTGGCT AGTGTATCTT GTAAGCTTGC CTGATCTCCA CTTGCTAGTA 582 0 

GAGTTTGGAG TAGGTAAGTT GTG AT TGCGT TTCCTGCAAC AAATGCTGGA AGCAAAGCTC 58 80 

CAAGAGATAC CAAAACTACC TCTAAACAGA ATTGTAGGAA GATCGAGCTC TTGCCTTTTC 5940 

CAAGTGCAAG TAAAATCCCC ACTTCATAGA CCCGTTCTCT CAACCAGAGA G AC AAAAC C A 6000 

GAATTAAGGC TCCAGCTCCT GCTATCAACA TCCCATAAAG GAAGATGGTC AGGAAGGTTT 6060 

GGAAAGTTGC AACTGAGTCT TTGATTTGTT CAAAAGCCTT GTTTTCCTTT TCGACTTGGT 612 0 

AGCCTTGATT TTCCAAGGCC AAGTTTTCTA CCTGCTTCAT GAGTCCGTCC ATTTCCTTAG 6180 

G AT TTTC T AC ATAGAAGCGT GCTGCACTGA CTTGAGCTTC ACT AT T GC C C AAAAGGGTTT 62 40 

GGCTACTTTC ATAGTCTGTA AAG AC T TG AT TTTCACTGAA GTCAGAAGAC AAGCCTGTGA 63 00 

ATTTCTCTTG TTTTTTACCA GAAAAGATGC CG AT AAT CT C AAACTCTACT GTTTGTCCTT 63 60 

TTCCAGATTC AGACTGACCA GCATCCAAGC CAATCTTGTC ATGAAGCGAA AGACCGTTCT 64 20 

TCTTAGCCAA TTCTTCGTGG AT AAGG AT TT TCTTGGAATC CCCTTTTTGA AGGTGTCGCC 64 8 0 

CTTCTTTTAG ATTGAAAGCC GAACTGGTAA AGGTTACATC CTTGGATGAA T C C T C AAG AG 654 0 

CCGTTAAGCT AACCAAGTTA TTGTCTGCAG CTG AT AAAT C ATCACGCTCC ACGCTCTGCT 6600 

CGCCAGTCAC TGCTTCCTTG TCTTTTAGTT TTGCGACCGT CTCAAGTTCA GGAGAGACAT 6660 

TTTCCAGCCC CTTAATCTTG CTTACAGATG CTAGGTCTGA CAACTTGAAT GTCTGACCAT 672 0 

TCTCTATCTT CT T AAT AG AA AAAGATGTAT TGAGTGATTT ATAAAGATTG CTTTCTAC^G 67 80 

TTTTGTTGGA CTTCATCAGA GTCAAACAGG CTGAAATTCC G G C C AAT AAG AC C AAT AAAA 6840 

TCAGAAATAA AATAAAACTT CTCAGTCGCT TTCTGCTGAC ATAAGCCCAA GATCTTTGGA 6 900 

TTGGATTCAT TTGTCACCTC CAT ATT TGT A AGACTATTAT AAAAC CC AAA TATGAAATAT 6960 

TTATGAAATA CGAAAAAAAA ATATCGAGTA GGGGATAATC TCTAGCCCCT CTCACACCAC 7 02 0 

CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT AACGCATGTC GTTCAAGGTA 7 080 

ATAATCCAAA CACGAAACCA GTCCACGTTT TTCAAGGACT GGTTTTGATA TAGCACGTTT 7140 

AAGTACCGAC TTCTGAGCTA CTATAGTAGA TTGAAACTAG AATAGTACAC CTCTACTTCT 7200 
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AAAAT ATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 72 60 

TTACTATAAT TGATAGTGGT CGCCCCAGCC AGATACCTTA TCTGCTATCC ATTTAGGAAC 7320 

CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 7 380 

CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 744 0 

AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT GAAAAAGTGT TGTGATCTTT 7500 

ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 7560 

TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC ATCATTTAAC GGTACATAAT 7 620 

AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 7 680 

CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT TCAATAATGG 774 0 

CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 7 8 00 

CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA TTCGAAGCAC TTGTCCAGTT 78 6 0 

TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT GAACTAGGAA CTTCAGTAGC 7 92 0 

AGGTTGATTA T C AAC ATT AT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 7 9 80 

ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA GTTCCTTCAT ATTTAGCTGT 8 04 0 

TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA CCTCCCTGAT TATCTTCAAT 8100 

GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA GCAACAACTG TTATAGTAAA 8160 

ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG CAGCTTCTAG GAATGTTTTA 822 0 

GCAGTCACAG TGTACTTTCC CAGCATCAAG CC ACT AT AAC TCTG C AC AT A AAAATGGAGA 82 80 

AGATGGCAAT CCTCTTCTCC AAAT AT T AAC TTCTTTACAA ACCAACTATA GTTGACAAAG 83 40 

AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCCCTG 84 00 

TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 84 60 

GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 8520 

GATCTTGATC C AAAAT CATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 8 5 80 

TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG AACACCTTGG TGACCCTGAC 8 640 

GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA GTCTGATAGG GCTTGATGTC 87 00 

CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 8760 

CCGCTTCAAT CGTATCACTT AAAAT AT C TC TTACAGAAGT GTATTTGTCT TGTTGAAGCA 882 0 

CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG ACAATTCGGA AAAAGAGTCC 8880 

GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA ATAAGAAAGA TGGCAATAAA 8 94 0 

CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT GCCCTCAACC AGATAGGAAT 9000 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AAGGGCAACC AACTGCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGAGAAGCT AAAGGG AG C G 9120 

T GAG AT AG C C CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9180 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG G AT GC AC C AT GAAACCAAAC 9240 

TCTCAAGACG TT C C AG ATT C TCAGTCATAT GGAGATAGCC CATAACCGCT TCAAATCCCG 93 00 

TGGACATACG ATAAGTCACG ACATCTGCAT TTTTAGCCTT TGTGTGGCTA TTGGTATTGC 93 60 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGGGTTTGGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCGTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAG GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAAGTCACG ATTGGC AC GC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 972 0 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CAT AT C T G C A 97 8 0 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9 900 

TTAAAGTTTT GTAACTCTTG G GC AT C C AC A TTTCCTGTAA ATGGTTGTTC GTAAGTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCG CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT GCCCATCAAG AGTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG G AC TT AG AC A T T T TG AC AT T GTCGATATTG 1014 0 

ACAAAGCCAT T GTGC AT CCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 10260 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 10320 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 103 80 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGGTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 106 80 

CCTTCTTCCC TGGCACGGTT GATAATCTTA TCATCCACAT CTGTAAAATT GGAAATATAG 10740 
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GCAACCTTAT ACCCACGGTA CTCAAAATAG CGACGAATCG TATCAAAAGC TACCGTCGAA 108 00 

CGGGCGTTTC CTACGTGGAT ATAGTTGTAC ACCGTTGGCC CACAAACATA CATCTTGATC 10 8 60 

TTGCCGTCCT CAATCGGGAC AAATTCTCGC AAATCACGAG ACATGGTGTC ATAGATTTTA 109 20 

ATCATAAATC ATAATCAGGA AAGCTGAAAT CCAAGAACAA TTAGTTTCAT CACTAAAAGT 1098 0 

TCAAGTAAAT TTCAGTCCGA ATATCTCTAC ACTTCGGAAT CCCTTGCTCC TTTCTCATTC 11040 

AGATAAACCA CCTGAGTCTG TTTGACAAAG CCAATTTTTT CATACAAACG TTTGGCACCT 11100 

ACATTGCTAT CTTCCACTGC AATCTGAAAT TCCTTGTCAT TTTGCTCAAT TAGTTGGTTG 11160 

ACGAGGGATT TTGCTAAGTA GCTTCCATAG CCTTTTCCAC GTTCAGGTTC CAATATTGCT 1122 0 

AAACCGTAGA GGTAATTCGT ATTAGTCGAT AAATCAACCG TACAAGTTCC AATAACCTGA 112 80 

CCAGCTTTTA ATAAAATATA TAGTCGGCTT TCTGGATCTT TCAGAGCTTC AGCGACATAT 113 40 

CTATCCACAA CTTCTCTCGA TTCATGTTCC TCTGAAAATG CCTGAAATTT TAATTGACTA 114 00 

ATTTGATCCT GATACGAACT ATCTGCTAAC AAAACTTCAA GATGGGAAAC ATTTGCTAAC 114 6 0 

GGATAAGGTC TTCTATCCTT ACCTAACCAA GTTTCTGTCT CTTCATCCTC GATTAGTCCC 11520 

CAGTTACTGG CAAAGTCAGG ATGATTCTCT AAAAAAATAC GTTCTGTCTG AAAAGTGACT 1158 0 

GACCGAATGG GGAAAGAAGC TGTTTCTCTC TCAAAACTAG TAAACAATGC ACGCGCAATC 11640 

CCCTGACGGC GATGACCTGG ATGAACCAGT ATCGTCACTT CTACATCTTG GTCATCTGCA 117 00 

TAGACAGTTA ATAAACCAAC AAGTTCGCCT TTTTCATAAT AAAGGAAAAA GGCGGGCATG 117 60 

TTTGGGTCAA AAT TAAGC AT GTTAGAGAGA TAGGGATCGC GATAGGTACC GTCATAGTTT 1182 0 

TGGCAACAGT TAATTACTTT TTTCGCCTCA GATAGCTCCT CTTGGCTTAA CTTGTTTCTT 118 80 

GCTTGAATCA TATAGGTATC CTCTACAAAC CAGACGATCT GTGACTGGCA TCTTTAGCCT 11940 

GCTCGAGTTT AT TG AC AT AA TACTCTCGTT TTTCTTCGAC TTCGTGAATG ACAGGCTCAT 12 000 

CTTTCTTACC ATGAAGACGG ACAATCTTGG CCGGAATACC GACAACCGTC ACGTCACTAG 12 060 

GTACATCTGC TACGACAACT GCTGCAGCAC CGACCTTGGC ATTTTCACCA ATTTCCACAG 1212 0 

GCCCGATAAC TTGGGCATGG GCTGATATGA GGGCTCCCTT TCGTACAGTC GGATGGCGTT 12180 

TGCCACAGTC TTTCCCTGTT CCCCCGAGAG TCACTCCGTG ATAGAGAAGA ACGCCTTTTT 12240 

CAACAATCGC TGTCTCTCCA AT C ACC AG AC C AG AACC AT G GTCAATAAAA ACACCTGAAT 12 3 00 

CAATCTGGGC TCCTGGATGA ATCTCAATCT GAGTCCAAAA GCGCCAAAAC TGACTGTACA 123 60 

TACGAGCTAA TAGTTTGAAG CCGTGCTTCC AGAGAAAATG CGAGAGACGG TGGGCCGCCA 1242 0 

AGGCCTTGAC ACCTGGATAA GTCAGCAAAA CCTCCAAAGT GGTGCGGGCC GC TGG AT CAT 12 4 80 

TTTCTTTTAC AATATCAATG GTTTCGCGCC ACCACCCCAT ACATTTCTCC TTTTCTTATT 12 540 
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CTGAATCTTT TGATGTTTCT GTAAATTCTT TCTTAGGTTT GTAATCCTTT TGATGACGTG 12 6 00 

GGCGGTGAGG GCGCTCAGAC TTTTCACCTT TTTCATCATG CTCAGGTTTT GGCGGACGAG 12 660 

GTAGAAGAGC CTTCATAGAG GC AT C GAT AC GGCCTTTTTC ATCAATTTTG ATAACCTTAA 12 72 0 

CATCAACTTC ATCCCCGATT TCTACCAAAT CCTCTACACG ATTGGTACGA GTCCAAGCCA 12 780 

TCTCAGAGAT ATGAACAAGG GCATCTGTCT TATCAAAGAG GTTAACAAAG GCACCAAATT 12 840 

TCTCGATACG AACGACTTTA GCACGGTAAA CTTCATCCAC TTTGGCTTCA CGAACCAAAC 12 9 00 

CAGCAATAAT TTCTTTGGCA CGGTTAATAG CATCTTGGTC ACTAGAGTAG ATAGACACAT 12960 

TTCCTTCTTC GTCTATATCA ATCTTAACAC CTGTTTCAGC GATAATCTTG TCGATGGTTT 13 02 0 

CTCCACCCTT ACCGATGACA ATCTTAATCT TGTCCACATC AATCTTGATC GTATCAATTT 13 080 

TCGGAGCAGT TGGAGCCAAT TCTGGACGAA CTTCTGGAAT GGTTGCTTCA ATGACATCAA 13140 

GG ATT TC AAA ACGCGCTTTC TTGGCTTGAG CAAGAGCCTC CGTCAAGATT TCTGCAGTAA 13200 

TCCCTTGAAT CTTGATATCC ATTTGAAGGG CTGTAATCCC ATCACGAGTA CCTGCAACCT 13 2 60 

TGAAGTCCAT ATCTCCAAAG TGATCTTCCA AACCTTGGAT ATCTGTCAAT ACTGTGTAGT 13320 

TATTTCCATC TGAGATAAGC C C C AT AG C AA TACCAGCTAC TGGCGCCTTG ATTGGCACAC 13 3 80 

CACCAGCCAT AAGGGCAAGA GTTCCCGCAC AGATAGAAGC TTGAGATGAA GAACCGTTTG 13440 

ATTCCAAAAC TTCTGCTACT AGACGGATAG CGTAGGGGAA TTCTTCCAAG CTTGGCAAGA 13500 

CTTGAGCAAG AGCACGCTCA CCAAGGGCAC CGTGACCGAT TTCACGACGA CCTGGCGCAC 13 5 60 

CGTAACGACC TGTTTCCCCT ACAGAATATT GAGGGAAGTT ATAGTGGTGC ATAAAGCGTT 13 62 0 

TCTTGTACTC TGGATCCAAA C CAT C AATG A TTTGAGTTTC TCCCATCGGA GCCAAGGTCA 13 680 

AGACTGAAAG AGCTTGAGTT TGCCCACGAG T AAAG AG AC C TGAACCATGT ACACGAGGAA 13740 

GGAAGTCAAC AACCGCATCC AAAGGACGGA TTTCATCGAC CTTACGACCA TCAGGACGCA 13 800 

CCTTGTCTTC TGTAATTAAA CGTCGCACTT CTGCGTGTTC CATTTGTTCC AAGATTTCAG 13860 

CCACATCACG CATAATACGG TCAAATTCTT CGTGGTCCGC ATATTTTTCT TCGTAAACGG 13 92 0 

CAGTCACTTG GTCTTTCACT ACTTGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT 13980 

GAACTGCCTT TTGGAGGTCA CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT 14 04 0 

CCACGTGAAG CAATTCCACT TCTGCTTTTT CTTTACCGAC AGC AG C AACG ATTTCTTCTT 14100 

GGAAGGCAAT CAATTCTTTG ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT 14160 

CTTCTGACAA TTCTTTGGCA CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA 1422 0 

CTGTCAATTC AAGAAGAGAT TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC 142 80 
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CAT C T AC AT A TCCCACTTGT ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 14340 

ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 14400 

AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 144 60 

TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT 14520 

TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CAT AC AT T T T TTCTTCGTAG TTGACTTGGA 14580 

GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 14 640 

ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 147 00 

CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 147 60 

TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 14 82 0 

CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 14 880 

ACACAGCTTT TCGGCCGTGT T C AATT AC AC AAGATATTTT GGACGGTTCG GCTTGCCGAA 14 94 0 

CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 15000 

TTTTTCCTAA G AAAT GAG AC CAAAATTCAA GTCATCAAGA TACCAAGCCG TCAAGCAACT 150 6 0 

CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 1512 0 

CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT T AAT AC T AAA 15180 

GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 152 4 0 

CTCGTCAAAT AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 153 00 

TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT T TAT CAT AT T 153 6 0 

TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 15420 

TGGTTTGCTT TTATTATCCT AGAGACTGGT GATTAAACAA GGCATGGGTT GCTTGATGGA 154 80 

TGTATTTTGC TGTATCAGCA TTATTCATCG TAT AG AG AT G CACACCGGCA ACATCCTGAG 1554 0 

TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG AG CG ACT C AG 15 6 00 

GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGGAAGATGG ATATTCTCAC 15660 

AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GAT T C AG AAT TGGCATAATT CCTGCATGAA 15720 

TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA AT C AT AG AAG CGCTCATTGT 15780 

CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 15 840 

GAATATCTGA AATCTGATTT GGCGAATCTG GATGCCCTTC T GG AT AG C AA GC T C C AAT AA 15 900 

TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA GTCGGTTGCA TAGCGGAAAT 15 9 60 

CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 1602 0 

CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 160 8 0 
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GCAAGTGGGC AATGGTCGGA ATCGCCAAAT CATTTTGGAT AAAGTCAGCC AAACGAACCG 16140 

TCGTTTCCTT GATATTAAAT T T ATT AT T GC TGGCAGTTAC ACT G AT AAAA TGGGGAGCCA 162 00 

ACTCCTGCAT ATCCTGCAAG GCTGAAATAA TGTTATCATT ACCCACGGCT GGGTTTGGAG 162 6 0 

GGAACACTTC AAATGAGAGT GACGGTGTTT GGCGTGACAT ATGTAATAAC CTTTTCTAGT 16320 

TGATTTCTTT TTGAACAACC ACTGTATGGA GAGAAATCCA ATCTTACAAT TTCTCACGCG 16380 

CAGCTTTAGC TGCTTCAACA AGGCGGATCA AGCTTTCTTT TGTTTCTGGG ATACCACGTG 1644 0 

TTTTCAAACC ACAGTCAGGG TTGATCCAAA CTTTCTTGCT TGGCACTTTA GCAAGGATGG 16500 

CTTCGATTGT GTTGTCGATT TCGCCTTCAT TTGGTACACG AGGTGAGTGG ATATCGTAAA 16560 

CCCCAGGTCC CACTTCTGTT TGGAAGTTTT TCGCTTTGAG TTCGTCCAAG ATTTCAAGGT 16620 

TTGAACGGTT AGCTTCAAAG GAAATAACGT CTGCATCCAT GTTATCGATA GCTGGGATGA 16 680 

TAT C T GT AAA TTCTGAGTAA CACATGTGAG TGTGGATTTG TGTGTCTGGC GCTACTGTTG 1674 0 

AGTGTACCAA GCGGAAGGCA GGAATAGCCC AGTCAAGGTA GTCTTCGTAC CAGTCGCTAC 168 00 

GGCGGAGTGG CAATTTTTCA CGAAGAGCAG CCTCGTCGAT TTGGATGATT TTCACACCAG 16860 

CAGCTTCAAG GTCAAGTACT TCATCCTTGA TAGCAAGGGC GATTTGGAGA GTTGAATCCT 16 920 

TGATAGAGAT GTCTTCACGT GGGAATGACC AGTTAAGGAT GGTAACAGGT CCAGTCAACA 16980 

TACCTTTAAC AGGTTTGTTT GTACGACTTT GTGC AT AG C T AGACCATTTA ACAGTGATAG 17 040 

GGTTAAGACG AGTGAC AT C A CCCCAGATGA TTGGTGGTTT TACCCCACGC ATACCGTATG 17100 

ATTGTACCCA TCCATTTTTA GAGAAGAGGT ATCCTGACAA GTTTTGACCG AAGTACTCAA 17160 

CCATGTCATT ACGCTCAAAT TCACCGTGAA C AAGG AC AT C AAAGTCAATA TCTTCTTGCC 1722 0 

ACTTGATCCA TTCGTCAATC GTTTCAGCAA GGAAAGCGTC GTACTCTTTT TGAGACAATT 172 80 

CACCTTTACG GTAAGCCAAA CGTTTGGCAC GAACTTCTTT TGTTTGAGGG AATGAACCAA 1734 0 

TCGTTGTTGT TGGAAGAGCT GGAAGTTTGA AAGCTTCTTC TTGGATAGCT TCACGTTCTG 1740 0 

CAAAGGCTGG CAAACGAGTG TAGTCTGCGT CTGTCAAGCC AGCGATACGC GCACGAAGTT 174 60 

CAGCATTTTC ACCAACACGC TCAGTCGCAA AGAGTTCTTT GTTGGCTGCA AGAGCTTCTG 17520 

AACCTTGACC ATTTCGGATA GCATCCAAGT CACGGATTTC ATCCAATTTT TCAACTGCAA 17 580 

AGGCAAAGTG GTTCAAGAGT GCTGGTTCAA ATTCTTCATT AGCAGTTGTA AAT GGC AC AT 17 640 

GAAGAAGTGA GCAAGAGCTT GTCAAAACGA TGTTTTCAGC TGGAATTTGC TCAAGAACAG 177 00 

CCAAGCTCTT TTCGTAGTTG TTGCGCCAGA TGTTTTTACC ATTGACAATA C C T AC AT AG A 177 60 

GAGTCTTGTC AGCTGGGAAG CCACCTTTAA CGAGTTCAAG AGTTTTCTTA CCTTCAACAA 17 82 0 
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AGTCAAGACC GATAGCATCT ACTGGTAAGT TTACAAGGTC AGCGTATACG TCACGAACAT 17 880 

CACCGAAATA AGTTTGAAGC AAGACTTCAA GACCTTTTTT GTCAGCCAAG AGTTTGTTGT 17 94 0 

AAAGGTTCAA GAAGAGAGCT TTTTCTTCAG CTGTCAAGTC TTTTACAAGA GCCGCTTCAT 18000 

CCAATTGGAT GCGAGTCGCA CCAAGTTCAG CCAATTTAGC AAAAACTTCT TGGTAAGCAG 180 60 

CCACTAAGCT ATCTACGAAG TCGTCTGCTT TCACGCCTTC TTCAAAGTCT GACAATTGAA 1812 0 

GGAAAGTGAA GGGACCTACA AGAACAGGAC GAGTGTTCAA TCCAAGTTCT TTGGCTTCTT 18180 

GGAACTCATC GAAAATCTTG TG AC C AG C C A ATTTTACTTG AGTGTCTTTT TCAAATTTAG 182 40 

GAACGATGTA GTGGTAGTTA GTGTTGAACC ATTTCTTCAT TGGAAGGGCG CGAACGTCCC 183 00 

CTTTTTCTCC CTGGTAACCA CGTCCCAAAG CGAAGTAGCG CTCAAGGTCA GACAAGTCCA 183 60 

AGTTTTGAAC GGATGCAGGC ACCACGTTGA AAAGGAAAGC CGCATCTAGG AAGTT AT C AT 1842 0 

AGTGAGAAAA GT C ATT TG AT GGAATTTCAG TGATGCCTTT TTCTTTGACA ATGTTCCAGT 18480 

GTTTAGCACG CAAGTCTTTT GCTGCTGCTA AAAGTTCTTC TTCTGAGATT TCTTTTCTAA 18540 

AGTATTTTTC AGTTGTAAAT TTTAATTCAC GGAATTCGCC CAAACGAGGG AAACCGATGA 18600 

TTGTAGTTGA CATGATGTGT CCTCCAAAAT TTGTTGTTGA AACTATCTTA ACAGAAAAGA 18 6 60 

AAGCGTCTGT ATAATTGTAA AAAATTAGGG TTTGATATAG TTTGAAACTA TATATCTGTT 18720 

TCGGACAAAA G AAAAAG AC T TGAAGCAAAC GTCTCAAATC CTTTGTAATT CTTACTTTAC 18780 

AGCTATATTC CAATTAGAAT ACTAAAACAT GTTATTAGTA ATTCTTATAA GTGACTATGA 18 84 0 

CCTGTTATTA G AAAAGAC T A TAACTGATTC TAGTCAACTT TTTCCCTGTT CAAGTGGGAC 18900 

GATTGCTAGT GTCTTTCCTA AACTGGCTAG GACTTTTAAG ACTGTATCCA ACTGAGGACT 189 60 

AGTCTTTCCT GTCTCCATCC T AGCT AT G AC AGGCTGGCTT ATTCCACTGA CTTCTTCCAG 19 020 

CTTTTTCTGA CTGATTCCTT GTTCATACCT AGCCTCAATC AACTCGCTCA TGATAGCCAC 19080 

TCGCATATCA CTTTCAAGGA TTTCCTCCTT GCTAAAGAGC TCAGATGGAC ATCCTTCCAA 1914 0 

TTACTCCCAA TAGCACTATT CTT CATC ACT TAACCCTCTT TTTTTTACGT CTATGTATTT 19200 

TTAAAAAAAT GAGCGAATTA TGATTCGATA GATTGACCAG TGGGTTTAAA GTTGGTGCTA 192 60 

GCCTATTTCT TAAGCGATTT TCCTTTTCTA GGATAAAGCA GTTCCTGCTT GCTTAACCCC 19 32 0 

AATTTTCCAC GATGAATCCA ATAGTAAATG GTTGAAATTC CCACGTTAAC CCCTTTAGCC 19380 

ATCACCATCA TTTCAGGCGA AAATTTTTGG TTATGTTTTT GGTTATGTAT AGTGGAGAAT 1944 0 

CTTTTCCTTT AGTTTCTTAA GACTGTTGAG CGTAGTCGGC AGAATAAATC TCTTTGAAGC 19 500 

GCCCTTTTCC AAGACATTGT CGGACTGTCC CACGCTTGAT TTCAGTGTGG AT AGT T T GAG 19560 

GAGCTTTTCC AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCGTT 19 620 
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CGATTAAGCG ACGGCT AT CG ATTGTCAAAT GTTTGCCTTT TGTAGTATAA TTGTCTTGCA 19 680 

TTTCTGTGCC TTTTAATCAT TTCAATCTTA AATTGGACTT TTTTTACTTG GGTTGTACTT 1974 0 

AATCTATGAG GAAGACAAGA AAAAGAATAT CAATCAAGTA AAGTCACAAA GTCACATTAG 19800 

CTCCGAGCAA CCATTGCAAA TTGAGGTACT CACACAATGA TTAAAACATT TCTCTCTGCC 19860 

CTTTCGGTCA TTCTCTTTTC TATCCCTATC ATAACTTATT CTTTTTTCCC ATCTTCTAAT 19920 

CTTAACATTT GGCTATCTAC CCAACCTATC TTGGCACAGA TTTATGCCTT CCCCTTAGCT 19980 

ACTGCAACTA TGGCTGCTAT TTTAAGTTTC TTATTTTTTT TCCTATCTTT TTACAAGAAA 2 0040 

AATAAACAAA TACGGTTTTA CTCTGGCATT TTGCTCTTAC TATCGCTCAT ATTACTATTA 20100 

TTCGGAACAG ATAAAACCCT TTCTTCTGCA TCAAATAAGA CTAAAACCTT AAAATTAGTA 2 0160 

ACTTGGAACG TCGCTAATCA AATAGAAGCA CAACATATTG AGCGAATTTT TAGCCATTTT 2 022 0 

G AC G C CG AT A TGGCTATATT CCCTGAACTA GCTACCAATA TCAGAGGTGA GCAAGAAAAC 2 0280 

CAGAGAATCA AACTATTGTT TCATCAAGTT GGACTTTCTA TGGCCAACTA TGATATTTTC 2 0340 

ACTTCTCCAC C T AC C AAT AG TGGAATAGCT CCTGTGACTG TGATTGTCAA G AAAAGT T AT 2 04 00 

GGTTTCTATA CAGAAGCTAA AACTTTTCAT ACAACACGGT TCGGGACAAT TGTATTACAT 2 0460 

TCGAGAAAAC AAAAT AT AC C AGATATCATT GCCTTGCATA CTGCGCCTCC TCTGCCAGGT 2 0520 

TT AAT GG AAA TCTGGAAGCA AGACTTAAAC ATCATTCATA ATCAATTGGC TT C AAAAT AT 2 0580 

CCAAAGGCTA TTATTGCAGG TGATTTTAAT GCAACTATGC GTCATGGAGC AC TTGC AAAA 20 640 

ATAAGCTCTC ATAGGGACGC ATTAAATGCA CTGCCACCTT TTGAAAGAGG AACTTGGAAT 2 07 00 

AGCCAAAGTC CAAAACTTTT TAATGCAACA ATAGATCATA TTTTATTGCC TAAAAACCAC 2 0760 

T ACT AT GTT A AAGATTTAGA CATTGTAAGT TTTCAAAACT C TG AT C AT AG ATGTATTTTT 2 0 820 

ACAGAAATCA CATTTTAATT ATTTTATATA AAATCACCCC TCTAATGTTC ATAAACTAGA 2 0880 

GGGGGAATTT GTATCCTACT ATCGTTTAAC GCACTTCTGC ATTGACTTTT TCTTCGAGAG 2 0940 

ACGCTTGGAT TTTTTCCATA TAGCGTGCGA CTTCTTCGTC CGTTAAGCTG TCTTCTGGAT 21000 

TTTGGAAGGT CAAGCTATAA GCCATTGACT TCATACCAAG TCCCAGTTTT TCACCTGAGA 210 6 0 

AGACGTCAAA GAGTTTGATA TCTGTCAAAC GTTTCACGCC GGCAGCTTGG ATAGCATCTA 2112 0 

CAACTTCTTG GTGAGTCACT TCTGCCTTGA GGAGAAGGGC AACGTCACGG CTGACTGCTG 21180 

GGAATTTGGT GATTTCCACA AATGGAACAG CAGGTTGGAG CGCCCCTTCG ATGG CTG AAA 212 4 0 

GGTTAAGCTC AGCTACATAC GTTTCTGGAA TATCGTAAGC CTTGGCAGTG ACTGGATGCA 21300 

CTTGGCCAAG GAAACCAAGA ACTTGGTCAC CGAGTGAAAT CACGGCTGTA CGACCTGGAT 213 60 
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GAAGGCTAAC GATTTCAGAT GTTGCTGTAT AGGTTACTTG GAG T CC C AAA CGAGTAAATA 2142 0 

GGGCTTCAAG GATTCCCTTA GCATAGAAGA AATCAACTGG AACTGCTGCT GTTTGGAAAT 21480 

CTTTTTCAGC AACCAAGCCT GTCAAGGCAA AGGCAAAGCT GTTGATCTCA TTTGGAAGTT 21540 

CTTCTTTTGG ATTACCTGTT TGTTCAAAGA CTTTTCCAAT CTCATAAAGG GCCAAGTTTT 21600 

TATTCTTACG AGCCACGTTG TAGGCAACGG TATCAAGGAT CCCTGAAATC ATATTTTGAC 21660 

GGAGGACTGA ACGATCCACA GTCATTGGCC ACATGAGTTC AGTAAGGTTA CTTGGTTGAG 21720 

CTGTGAACTC AACTGCTTTT TCAGGAGTTG TC AG AG CAT A GGTGATGATT TCTGTCAAAC 217 8 0 

CTGCTCCTTC AGCAATGGTA CGAACTTGAC GGCGGAGTTT TTGTATCACA GTCAATTCAC 21840 

CAGCTGTACC ATCGTCTTTT GGAAGGCTGG TTGGCAAGCG GTCATATCCA TAGATACGAG 21900 

CG ATT T C TT C AAAG AG AT C A GCTTCGATTG TGATATCCCA ACGACGACGT GGTACGCTGA 21960 

CTGTAAAGCT GTCTGCATTT C C AG AAAG AC CAAAGCCAAG ACGACGGAAG ACGTCTTCTA 2 2 020 

CATCAGCATA AGACAGCTCA GTTCCGAGGA CACGGTTAAC ATCAGCAAGG GTTGAAGAAA 2 2080 

CTTCCACATC AG AGGT AT C A AGCTCACCCG CTGAAACGAT ACCCTTACGC ACCGTCGCGC 22140 

CTGCAAGCTC TGCAATCATG CTAGCTGCCG CATCAAGGGC TTCATTAACT GTTGCCACAT 22200 

TAATTCCTTT TTCAAAGCGA GAAGATGACT CAGAACGAAG GTTCAGGCGA CCACTTGTCT 2 22 60 

TACGGATAGA TTTGCCATTA AAAACAGCAG CTTCAAGGAT AACACGACTA GATTTTTCAG 22320 

AAATTTCTGT AGCCTGACCA CCCATAACAC CGGCAAGGGC TACTGGTTTG TCAGCAACTG 223 80 

TAATCACGAG GTCTGTCTCA GCCAAGTCTC GTTCTTCACC GTCCAGGGTC ACTAATTTTT 2 2 440 

CACCATCACG CGCTTCACGC ACACGGATGT CAGTCCCTTC AAATGTGTCC AAGTCAAAAG 22 500 

CATGCATAGG TTGACCAAAG TAG AG CAGG A TGTAGTTTGT CACGTCTACA ACGTTATTGA 22560 

TGGGACGGAT GCCTTCGTTC ATGAGAAGGT TTTGCAACCA TTGTGGACTT GGTGCGATAG 2 2 620 

TCACATTGTC CAAGATACGA GCTGCATAGT AAGGCGCCTT GTCTGTCTCA ATGCTGACAG 2 2 680 

AAAGGGCATC TGCCGCAGCT TCATTAGTTT CTGTTAGAGT AAATTTTTTA AAGTTGACTG 2 274 0 

CCTTGTCATA GATGGCTGCC ACTTCGTGAG CCACTCCACA CATAGAAAGG GCATCTGCAC 2 2800 

GGTTTGGTGT GATGGAAAGT TCGATGATTT CATCATCCAA GT C T AGGT AA G AAAAG ACT T 22860 

CCTCACCTGG CACGGCATCT TCAGGCAAGA TTTGGATGCC ATCTGCGAAT TCCTTAGGCA 22 920 

CAACTGAGTC AG AAATTC C C AATTCACCAA GTGAACAGAT CATTCCAAGT G AC TCCAAAC 22 980 

CACGGATTTT TCCTTTTTTG ATTTTGTAGT TATCAGCGAT ACGAGCTCCT GGAAGAGCCA 2 3 040 

CCATGACCTT GAT C CC AG C A CGCACATTTG GGGCACCACA AACGATCTGA CGCTCTTCTT 23100 

CTTCGCCAAC GTTAATCTGA CAAACATGGA GGTGAGTCTC TGGCACATCT TCGCAAGACA 2 3160 
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AGACCTCACC G AC G AC AATT T TT G AG AG AC CAGCAGCTGG TGATTCGACA CCCTCTACCT 2 3220 

CGATCCCTGT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 23 280 

ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG ACAGTTGTCA 2 3340 

CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA CCTTAATTTC 2 3 400 

TTTCTCAGTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG TTGGCTAAAT 2 34 60 

TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG TATTATGCTC CCAAACACCT 2 3 520 

AGCCAAGCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 2 3580 

TTACCTAGTC CAAATCCTTG GAATTTTTGT AG C AC AT AG A GACGTTGAAT TTCAAAAGCG 2 3 640 

TCCTCTAATT CTCTCTCAGT TTGAGCACTT CCCCAGTTGA CTT T GAG AAA AC C AG CT AT C 2 37 00 

TCCTCCTCAT GCATAATGAA AT AG GTTT C A GAGTCAGGAT TTCCCAACTC AGT TG AC AAA 2 37 60 

GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATT AT CAT AC 2 3 820 

GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 2 3 880 

TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 2 3 940 

CACGGATATC GTTGATTCCA TAACGGAGCA TAG C T AC ACG CTCTTGTCCA AGACCAAAGG 2 4000 

CAAAGCCAGA GTATACAGTC GC AT CG AT AC CACTCATTTC AAGGACACGT GGGTGAACCA 2 4060 

TACCGGCCCC CATAATTTCG ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 2412 0 

CACACTTGAA GCAAGAAACA TCCACCTCAA CAGATGGCTC TGTGAATGGG AAGTAAGATG 2418 0 

G AC GC AAACG AATTTGACGC TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 24240 

CTTGAAGATC AGCCATAGAG ATATTTTTCC CAACTACCAA GCCTTCGATT TGGTGGAATT 2 4300 

GGTG AC TGTG GGTCGCATCG TCCGTATCGC GACGGAAGAC ACGCCCTGGC GAG AT CAT C T 2 43 60 

TCAAAGGACC TTTAGAAAAA T C ATGGGC AT CCATAGCACG CGCCTGAACT GGAGACGTGT 2 4420 

GGGTACGGAG CAAGATTTCT T C AGTG AT AT AGAAAGTATC CTGCATATCA CGAGCTGGGT 24480 

GGTCTTTTGG AAGGTT C AT A CGTTCAAAGT TATAGTAGTC TTGCTCCACT T C AAAAC CAT 24540 

C C ACG ACT TG ATAACCCATA CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 24 600 

AAACGTGACG GTGACCAGTC GCAACTGGAC GACCTGGAAG CGTCACATCT ATACTCTCGC 24660 

TAGCCAGTTG AGCCGCGACT TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 2 4720 

CAGTCAAGAC AT C ACGAGC T TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 2 47 80 

AAACATCTTT CATCCCTTTG AGGATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 24840 

CACGCAAATC TTGCATCTCT TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CTAGCCAGCG 24900 
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TTTCTTCGCG AAGCGCTTTT AATTGTTCTT CAATAGTTGA CATATTTCCT CCATCAGTCT 24960 

CTCGTAGATA AAAAGAAAAC CACATGCCAA AAACTCCACT CGGAGCGTTG ACACGCGGTA 2 502 0 

CCATCCGTTT TCATCTGACA AGTCAGACCT TCATTTCTAA ATCCATGCGC AAGTGAATTC 2 5080 

ACCCAGCTTT CATATAGAGA GCTTGCAGTC ACGGCTCTCC TCCCTGATAT ACTTCCCTTG 2 5140 

GGCTACTAGT CTTTCAGATT CCTATTCAAT TACTACTTAG TTTATCAGAT TTTTACCATT 2 5200 

CTTGCAAGAC CTATCTTACT TCTGCTTGTT AGCTTATTCT TATCTAAATT TATATAAACc 2 5260 

TTATCTAAAT TAACTATTTA TAATTTTTGT AACAAAATTA AATTAATTGA CACTCCCCTA 2 5320 

TAAAATAAAG AAGTTTAGAA TTTAATGTCT TCCAAACTTC TTTATTCCAT ATTTAATGAA 2 53 80 

ATGCCACCTT AACCGTGATA ATAGCTAGTC ATCAATAAAA AACTATTTGA ATAAGGATTC 2 5440 

TCCATTTGAT TCAATCACTT CTTTATACCA AGTAAAAGAC ATTTTCTTAT ATCGATTTAA 2 5500 

TGTACCACTT CCATCATCGT TTCGATCAAC ATAAATGAGA CCGTACCTTT TAGAAAGTTG 2 5560 

TGCAGTGGAC ATAGAAACAC AGTCAATACA TCCCCAAGAC GTATAGCCCA TAATTTCAAC 2 5 620 

ACCATCCTGT AGAGCTTCAG CAACTTGCAA TAAATGTTCT TTCATATACT GAATTCTATA 2 5680 

ATCATCTTGG ACGGTTAAGT TATTAAGTTC ATCTTTTATT AGTTGATCTT TAGCACCTAA 2 5740 

TCCATTTTCT ACTATAAATA ATGGGATTTG ATAACGGTCA TAATATCTAT TTAAAATTAT 2 5800 

ACGTAGTCCA ATTGGATCAA TTTGCCATCC CCACTCTGAA GACTCTAAAT AAGGATTTAC 2 58 60 

TAAACCACCA ATAATATTCC CTTCTCCTGA ATTATACTGT GTTGGAAGAG CAGATTGAGT 2 5920 

CACACTCATG TAATAGCTAA AGGATAAAAA ATCTACGGTA TAATTTTTTA ATAACTCTGC 2 5980 

ATCTTCAGCT GCAAACTCTA TGTTAATGTC ATTTTCCTTA AAATATCTTT TTGCATAATT 2 6 040 

CGGATAATAA CCTCTAACAT GCACATCTGA AAATAGATAA TTTAGATTCT CATACTCATG 2 6100 

AGTCGCCCAT ACATCTTTTG GATTTGGAGT CATTGGATAA GCTGGCATAG CTAATACCAT 2 616 0 

ACATCCCACC TTAAACTCTG AATTAATCTC ACGAGCAATT TTTGTAACCA AACTTGAGGC 2 622 0 

GACTAATTCA TGATGTATAG CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT 2 6280 

ATCTATTCCT CCACTAGTAA ATGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG 2 6340 

CCAATATTTA ACTTTATCTT TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA 2 6400 

ATGAAT CAT T CTCCTATCAA CCCATCCATG ATATTTTCTT GCTAAATATA ATGGAGTCTC 2 6460 

AT AGTGT GAA AGAGTTACAA GTGGTTCTAT CCCGTGAGCA TGTAGTTCAT CAAACAATTC 2 652 0 

AT C AT AAT AT TTCAACCCAG CTTCGTTAGG TTCTTCCTCA TCTCCTTTTG GAAAAATTCT 2 6580 

ACTCCATGCA ATAGAAGTAC GAAAAACATT AAAGCCCATT T C AG AAAAC A AGGATAT AT C 2 6640 

TT C C TT AT AT TTATGATAAA AATCAATACC TATCAATTTT AAGTTATCTT CTGTAGGATT 2 6700 
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TTCTGTTGCT TCTCCTAATC CACCTTTGGG TAACACATCC TGAACTGATA AGCCCTTACC 2 6760 

AT CT T C ATT A TATGCTCCCT CTACTTGATT AGCTGCAACA GCTCCACCCC AAAGAAAATC 2 6820 

AT CTGG AAAA ATGGTCATAA CTTTCCTCCA TTATAATATT ACCAGTAATT CCTTAGAATG 2 6880 

CTCGATTGTC TGATTATTAG GTAATACTAA TACATCTAGA AAATCATTGG TATTCGTTAC 2 6 940 

AATTACTGGT GTAACTGTTT CGTAGCCTTT AGTCTTGATT AAATTCAAGT CCATTTCAAA 2 7 000 

AATCAACTGA TTTTTGAAAA CTCTGTCTCC TTCTTCTACA TGACTAATAA AACCTTGACC 2 7060 

TTTTAGCTCA ACAGTATCTA ATCCAATATG AATTAGTAAC TCAACACCCT CATCACTCTT 2 712 0 

CAATCCAATT GCGTGCTTAG TCGGAAAAAT ATTTGTAATT TTCCCATCAA ATGGTGCATA 27180 

AACCTTACCT TCACTTGGGA TAATCGCTAC TCCGTCTCCA ATTAGTTTAT CTGAAAATGT 27240 

TTTATCCTGG ACATCGCTTA ACGGAATGAT TTCTCCTGAT ATAGGAGAAA ATATCATTTT 2 73 00 

TTTATTTGAA ACTCCAGCTT CAACTTCTAA ATTGCTAGAA CTCTCTTCTT CAT CG AT TC C 2 7360 

AAATATATAA GCTAATACAA AGGTAATAAC AACCGAAATG ACCGCCACAA TTAAAGCATT 2 742 0 

TACAATATTT GATGGCACAT CAGAATAAAT AAATTGAGGC AACGCTATCA AAGATGGGAC 2 74 80 

AGCAAATAGA TATGCTTTAA CACTAGTAAG ACCTGCAAAT AATCCCGCTA ATCCACCACC 27 540 

AATCATAGCT GCATAAAGCG GTTTTTTATA TTTTAAAGTC ACACCATATA ATGCAGGTTC 27 600 

GGTAATCCCT GCAAGTAAGG CTGAGAAACC TGCTGCAAAA GCAATTTGTT TTGTATTATT 27 660 

ATTTTTACTC TTTAATGCAA C AG C C AT CG A AGCAGCCCCT TGAGCTAAGT TTGACCCTAA 2772 0 

CATTGCTGGA AGAATTAATA CGTCTGGAGT AGCAATAGAT GCCGCCAAAA AAATAGGTGC 2 77 80 

AAAAGCCCAA TGCATTCCAG TCATAACAAT AAATGGCATA ATAGCACCAA GAATAGCTAA 2 7 840 

TGTAAGCCAT CCAGCTACAC CATACATTTG CCCAACTAGA TTTGATAATC CTTCACCAAC 27900 

AATTACTCCA ATAGGTCCGA CTACAACTAA GGCAATACAG CTTGATACTA ATAATACTAG 27960 

CGTAGGTTGC AAAAAACTCT TAGTAATAGC TAGTGTTAAT TTAGCAATTA TTTTTTCAAT 2 802 0 

AT AT TT CAT C AACCAAACCA TAATAAGAAT TGGAACGACT GATGAACCAT AACTAGCTGG 2 808 0 

TGTCACAGGT GCACCAAATA AACTAAGAGG ATTCCCTGAT TGCACCATTT GAACAAAATT 2 814 0 

TGGATGGAGA AGTACACCTG C T AC AGAC AT AGCTAATGTA GATGTTACTT TTAATTTTTG 2 8200 

TGATGCAGAA TAAGCTAATA ACAGCGGTAA GAAATAATAT GGAGCATCCC CAAAAAATGT 2 8260 

CAAAAAAGCA ATAGTCTGAG AATCTGATTG CAATATACCA AGCATTGGTA AAATGATTAC 28320 

CAAGACTTTC AACATACCTC CCCCTAACAT TGCTGGAATG ATTGGAGTCA TGGAACCAGC 2 83 80 

G AT AT ACT C A ATGATTCTTT CT AAAAT AT T CCCTTTGTGC CCTTGAACAA CTGAATCGGA 2 8440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 2 850 0 

AATT T GAT AT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTGGTATCT T C TT C AC AT C 285 60 

ATCATCATTG AC T AAATT TT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 2 8 620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 2 8 680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 2 8740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 2 8 800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28 860 

AAGTCTTCAT CATCATCGTC GG 2 8 882 
(2) INFORMATION FOR SEQ ID NO : 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 835 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GCCTATGTCT TTTTCAAAAA AATGCTTGAC TTGAGACGGG AACTAGGGAA GTCTAAAGGC 60 

GGAAGGCATT GATTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGGA 12 0 

TTATATATGT AACTGACTTC GTCGATGCTT ATCTACAACC TCAAAGCAGT GCTTTGAGCA 18 0 

ACTTGCGGCT AGTTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATTATA TTACTTTCTA 24 0 

TTTGTAGGAG GTGGCTTATG AAGATTCCTC TCTTAACTTT TGCAAGGCAT AAATTTGTTT 300 

ATGTCTTGCT TACTTTGCTT TTTCTTGCTT TGGTTTATCG TGATGTTTTG AT G AC T TAT T 3 60 

TCTTTTTTGA TATTCATGCG CCCGATCTAG CTAAATTCGA TGGACAAGCA AT T AAAAATG 42 0 

AC T TATT AAA AT C AGC AT T A GATTTTCGTA TTCTCCAGTT CAATCTAGGT TTTTATCAAT 4 80 

CATTTATTAT TCCAATCATC ATTGTTTTGC TAGGTTTTCA ATATATTGAG CTGAAAAATA 54 0 

AAGTTTTACG ATTGAGTATT GGAAGAGAAG TGAGTTATCA AGGGTTAAAA AGAAAGTTGA 6 00 

CTTTGCAAGT TGCAAGTATC CCTTGTTTGA TATATTTAGT GACTGTGCTG ATAATTGCAA 660 

TTATAACCTA TTTCTTTGGG ACTTTTTCTC CTCTTGGATG GAATTCTCTA TTTTCTGATG 72 0 

GAAGTGGTTT ACAAAGACTC CTAGATGGAG AGATAAAAAG CTATTTGTTC TTTACTTGTG 780 

TCCTACTAAT CGGTATTTTC ATCAATGCAA TCTATTTTTT ACAAATAGTT GATTATGTGG 840 

GGAATGTGAC TCGTTCGGCA ATCACCTATT TGATGTTTCT TTGGCTTGGT TCTATGCTGC 900 

TTTATAGTGC CTTGCCTTAC TATATGGTTC CTATGACGAG TTTGATGCAA GCTAGCTATG 960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 

TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCGTAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTGTCAACA 114 0 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 12 00 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 12 60 

TTCATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CCATTACCAT 132 0 

CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 144 0 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 15 60 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 162 0 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 1680 

TAAACG AT G A GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 174 0 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG T G AT AATGG A 1800 

TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 186 0 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 192 0 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 2 04 0 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 2160 

GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 2 220 

GAAAATGGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 2 2 80 

GATGTATTAG TTGTCGAAAA T GGAC AT AT A CAAATGTAAA GGATATACAA TCCTAGGAGA 2 3 40 

TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT CCAAGGTTTA TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 2 4 60 

AGGATTGACA TCTTCATCAT AC AG AAAAT C TTGCTAGCTT TTGGAATTCT GTCCATTCTC 252 0 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2 580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2 64 0 

ATAAAAGTCG GAGAGGT T AG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2 700 
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CTTCCTTGAA 


AATAATCAGT 


AATTGTGCTA 


AAATTAAAGG 


AACATTCTAA 


AATATTCGGA 


2760 


AT T T AAAGT A 


AGGAAAAACA 


TGGCTAATAT 


TTTAAAAACA 


ATTATCGAAA 


ATGATAAAGG 


2820 


AGAAATCCGT 


CGTCTGGAAA 


AG ATGGC T G A 


CAAGGTTTTC 


AAATACGAAG 


ACCAAATGGC 


2880 


TGCTTTGACT 


G AC G AC C AAC 


TAAAAGCAAA 


AACAGTTGAA 


TTTAAGGAAC 


GTTATCAAAA 


2940 


TGGAGAATCA 


CTGGATTCAT 


TGCTTTACGA 


AGCATTTGCG 


GTTGTCCGTG 


AAGGTGCCAA 


3000 


ACGTGTCCTA 


GGTCTCTTCC 


CTTATAAGGT 


TCAGGTCATG 


GGGGGGATTG 


TTCTTCACCA 


3060 


TGGTGACGTG 


CCAGAGATGC 


GTACAGGGGA 


AGGGAAAACC 


TTGACTGCGA 


CCATGCCGGT 


3120 


ATACCTCAAT 


GCCCTTTCAG 


GTAAAGGGGT 


TCACGTAGTT 


ACGGTTAATG 


AATACCTGTC 


3180 


AG AAC GTG AC 


GCGACTGAGA 


TGGGTGAATT 


GTACTCTTGG 


CTTGGTTTGT 


CAGTAGGGAT 


3240 


TAACTTGGCT 


ACCAAATCTC 


CAATGGAGAA 


AAAAGAAGCC 


TATGAGTGTG 


ATATTACTTA 


3300 


CTCAACTAAC 


TCAGAAATCG 


GATTTGACTA 


CCTTCGTGAC 


AACATGGTCG 


TTCGCGCCGA 


3360 


AAACATGGTA 


CAACGTCCGC 


TTAACTATGC 


CTTGGTCGAT 


GAGGTTGACT 


CTATCTTGAT 


3420 


TGACGAGGCT 


CGTACACCTT 


TGATTGTATC 


AGGTGCCAAT 


GCGGTTGAAA 


CCAGTCAGTT 


3480 


GTATCACATG 


G C AG AC C ACT 


ATGTAAAATC 


TTTGAACAAA 


GATGACTACA 


TCATCGATGT 


3540 


GCAGTCTAAG 


ACT AT TGGT T 


TGTCTGATTC 


AGGGATTGAC 


AGGGCTGAAA 


GCTACTTCAA 


3600 


ACTTGAAAAC 


CTCTATGACA 


TCGAAAACGT 


GGCTTTGACT 


CACTTTATCG 


ATAACGCCCT 


3660 


TCGTGCCAAC 


TACATCATGC 


TTCTCGATAT 


TGACTATGTG 


GTGAGCGAAG 


AGCAAGAAAT 


3720 


CTTGATTGTC 


GACCAATTTA 


CAGGTCGTAC 


CATGGAAGGT 


CGTCGTTATT 


CTG AT GG ATT 


3780 


GCACCAAGCT 


ATTGAAGCCA 


AAGAAGGTGT 


GCCAATCCAG 


GATGAAACCA 


AGACATCTGC 


3840 


CTCAATCACG 


TACCAAAACC 


TCTTCCGTAT 


GTACAAGAAA 


TTGTCTGGTA 


TGACGGGTAC 


3900 


AGGTAAGACT 


GAGGAAGAAG 


AATTCCGTGA 


AATCTACAAC 


ATTCGTGTTA 


TTCCAATCCC 


3960 


AACAAACCGT 


CCTGTTCAAC 


GTATTGACCA 


CTCAGACCTT 


CTTTATGCAA 


GTATCGAATC 


4020 


T AAGTT T AAA 

j. n^ivj x J. j. ruin 


VJV_VTVJJ ± X \J X V^VJ 




X ^-Vjj x x t\\^ 


f -1 A A A A (~* /~" fTP i" 1 




A r\ a r\ 

4 080 


GGTTGGTACA 


GTAGCGGTTG 


AAACTAGTGA 


C T AC AT T T CT 


AAGAAATTGG 


TTGCAGCTGG 


4140 


TGTTCCTCAC 


GAAGTCTTGA 


ATGCCAAAAA 


CCACTATAGA 


G AAGC C C AAA 


TCATCATGAA 


4200 


TGCTGGTCAA 


CGTGGTGCCG 


TTACCATCGC 


AACCAACATG 


GCGGGTCGTG 


GTACCGACAT 


4260 


CAAGCTTGGT 


GAAGGTGTTC 


GTGAACTTGG 


AGGACTTTGT 


GTTATTGGTA 


CAGAACGTCA 


4320 


TGAAAGTCGT 


CGTATCGATA 


ACCAGCTTCG 


TGGACGTTCA 


GGTCGTCAAG 


GAGATCCAGG 


4380 


TGAGTCACAA 


TTCTACCTAT 


CTCTTGAAGA 


TGATTTGATG 


AAACGTTTTG 


GTTCTGAACG 


4440 


CTTGAAGGGA 


ATCTTTGAAC 


GCTTGAACAT 


GTCTGAAGAG 


GCCATTGAGT 


CTCGCATGTT 


4500 
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GACGCGTCAG GTTGAAGCAG CTCAGAAACG TGTCGAAGGA AATAACTACG ATACCCGTAA 45 60 

ACAAGTCCTT CAATACGATG ATGTCATGCG TGAACAACGT GAGATTATCT ATGCTCAACG 4 62 0 

TTACGATGTC ATCACTGCAG ATCGTGACTT GGCACCTGAA ATTCAGTCTA TGATCAAACG 4680 

C AC G ATT G AA CGTGTCGTTG ATGGTCATGC GCGTGCCAAA CAAGATGAAA AACTAGAGGC 4 74 0 

AATTTTGAAC TTTGCTAAGT ACAACTTGCT TCCTGAAGAT TCTATTACGA TGGAAGACTT 48 00 

GTCAGGCTTG TCTGATAAGG CCATCAAGGA AGAGCTTTTC CAACGTTCCT TGAAGGTTTA 4860 

CGATAGTCAG GTTTCAAAAC TACGCGATGA AGAAGCAGTT AAAGAATTCC AAAAAGTTTT 4920 

GATTCTACGA GTGGTGGATA ACAAGTGGAC AG AT CAT AT C GATGCCCTTG ATCAATTGCG 4 980 

TAACGCGGTT GGACTTCGTG GCTATGCTCA GAACAACCCT GTTGTTGAGT ATCAGGCAGA 504 0 

AGGTTTCCGT ATGTTTAATG ATATGATTGG TTCGATTGAG TTTGATGTGA C AC GCTT G AT 5100 

GATGAAAGCA CAAATTCATG AACAAGAAAG ACCACAGGCA GAACGTCATA TCAGTACAAC 5160 

AGCGACTCGC AATATCGCTG CTCACCAAGC AAGTATGCCA GAAGATTTGG ATTTGAGCCA 52 2 0 

GATTGGACGC AATGAACTTT GCCCATGTGG TTCTGGTAAG AAATTTAAAA ACTGTCACGG 52 80 

TAAAAGACAA TAAAATGAGA TAGTTTAGAG GCGGATATCT TGTGAAAAGT AAATTTTTAC 534 0 

TGGGTATCCG TTTGCTTTAT AAGGAGATGA GTTATGGTAT TTACAGCAAA AAGCTCTAAA 5400 

ATAAATATAG AAGAAGTTCG TGCCTTGTCA AAATTAGAAG GTCAGGCTTT GGAGAGGAAA 54 60 

TCACAGCGAG ATCAAGAGCT AG AAGC C AT T ATACGTGGAG AAGACCAGCG AATTCTCTTG 552 0 

GTAATCGGGC CATGCTCATC TGACAACGAA GAAGCTGTCC TTGAATACGC TAAGCGTTTG 558 0 

GCAGTCCTAC AAGAAGAAGT GGCAGATCGT ATCTTTATGG TTATGCGTGT TTATACTGCC 5 64 0 

AAACCCCGTA CCAACGGAGA TGGCTATAAG GGCTTGATTC ACCAGCCTAA CGCGACAGAA 57 00 

GCGCCTAGTC TTATCAATGG AATCAAAGCC GTTCGCCATC TTCACTATCG TGTCATCACA 57 60 

GAAACAGGGA TGACAACTGC TGATGAAATG CTTTATCCTG AAAACCTTCC GCTTGTACAT 5 820 

GATTTGATTT CTTACATGGC AGTTGGTGCC CGTTCAGTTG AAGACCAGCA ACACCGCTTT 5880 

GTGGCAAGTG GGGCAGGATT TTCTACTGGT TTTAAAAATC CAACCTCTGG AAATCTCAAT 5 94 0 

GTCATGTTTA ATGGG AT TT A TGCTGCTCAA AACAAACAAA GTTTCCTTTT CTTAGGAAAA 6000 

GAAGTAGAAA CAACTGGGAA CCCGCTTTCA CACGCTATTC TTCGTGGTGC TCTTAATGAG 6060 

TATGGAAAAA AT ATT C C C AA CTACTATTAT GACAATTTAA TTG AT AC CAT TGCCCAGTAT 612 0 

GAGAAAATGG GCTTGGAAAA TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 6180 

AAGCAGTATA TTGAACAGAT CCGAATTGTC CGCCAGACCT TGATTAACCG TGCTTGGAAT 6240 
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GAAAAAATTA AGCAGTTCGT TCGTGGTTTT ATGATTGAGT CTTATCTGGA AGATGGTCGA 63 0 0 

CAAAATGAGC CAGAAGTATT TGGTAAGTCT ATCACAGACC CTTGCCTGGG TTGGGATAAC 63 60 

ACAGAAGCTC T T GT C AG AG A AAT T T AC AAA ACGTTAGGAG AATAAGATGG CATTTATTGA 6420 

AAAAGGTCAA GAAATCGATA TGGAAGTCAT CAAGGCTGAA ACCCAATTGT CTGCGGAAGC 64 80 

CTTGAGACTC AAGGAAAGCC GTGACAGGGA ATTGGCAGAT ATTATTTCAG GGGAAGATGA 6540 

CCGTATTCTC TTGGTGATTG GTCCTTGCTC TTCTGATAAT GAAGAGGCGG TCTTGGAATA 6600 

TGCTCGCCGT TTATCTGCCT TGCAAAAGAA GGTAGCGGAT AAGATTTTCA TGGTCATGCG 6 660 

CGTGTATACT GCTAAGCCTC GTACCAATGG AG ACGG C T AT AAAGGATTAG TTCACCAGCC 672 0 

AG AT ACTT CT AAGGCTCCAA GCCTGATTAA TGGCTTGCAG GCTGTGCGCC AGTTGCACTA 67 80 

CCGCGTGATT AC AG AG AC TG GTTTGACAAC GGCAGATGAG ATGCTTTATC CGTCAAATCT 6840 

GATCTTGGTG GATGACTTGG TCAGCTACCA TGCCGTTGGA GCTCGTTCTG TGGAAGACCA 6 9 00 

AGAGCACCGC TTTGTGGCTT CTGGGATTGA TGCACCAGTA GGGATGAAAA ATCCAACCTC 6 9 60 

AGGAAATTTG GGTGTTATGT TTAACGCCAT CTATGCTGCT CAAAACAAGC AAACCTTCCT 7 02 0 

TTATCATGGG CAGGAAGTTG AGACATCAGG TAATCCTTTG GCCCATGTTA TCCTCCGTGG 7 080 

AGCAGTCAAC GAGTATGGCA ATTATATGCC GAATTACTAC TATGAAAATC TACTCCAAGC 714 0 

CATTGAACGC TATGAAACCA TGGGACTTGA AAATCCTTTT ATCCTCATTG ACACCAACCA 72 00 

TGATAACTCA GGCAAGCAAT ATATGGAGCA GATTCGAATT GTTCGCCAGA CCTTGCAGAA 72 60 

TCGTGATTGG AATGAGAAAA TTAAAAAGAC GGTTCGAGGA TTTATGATTG AATCTTACCT 73 2 0 

AGCAGATGGT CGTCAAAACC AACCAGAGAT CTTTGGTTGC TCTATTACTG ACCCTTGCCT 73 8 0 

AGGTTGGGAA AATACAGAGG CCTTGGTAGA AGAGATTTAT GTTACCTTGA CAAAATAAGT 744 0 

GAAAAGGATG GAGTTGGGGA ATCTCAACTC CTTTTGATGA GAATGATAGT TGGACACGGA 7 5 00 

ATTG AC AT CG AAGAATTGGC TTCGATAGAA AGCGCAGTTA CACGACATGA AGGATTTGCT 7 560 

AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 7 62 0 

ATAGAATATT TAG CT GGTCG CTGGTCGGCT AAGGAGGCCT TTTCCAAGGC TATGGGAACG 7 680 

GGCATTAGCA AGCTCGGTTT TCAGGATTTG GAAGTCTTGA ACAATGAACG TGGGGCGCCT 7740 

TATTTTAGTC AGGCACCATT TTCAGGAAAG ATTTGGCTGT CTATCAGCCA CACCGATCAG 7 8 00 

TTTGTGACAG CCAGTGTCAT TTTGGAGGAA AATCATGAAA GCTAGTCCAC ATAGACCAAC 7 860 

CAAGGCTCTG ATTCATCTGG GAGCTATTCG ACAAAATATT CAGCAAATGG GGGCTCATAT 7 920 

CCCTCAAGGA ACGCTCAAGT TGGCTGTGGT TAAGGCCAAT GCTTATGGTC ATGGAGCTGT 79 8 0 

TGCCGTTGCC AAGGCAATTC AAGATGATGT TGATGGCTTT TGCGTTTCCA AT AT CG ATG A 804 0 
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AGCCATTGAA CTCAGACAAG CTGGACTCAG CAAGCCAATC CTCATTTTAG GAGTTTCTGA 8100 

AATCGAAGCT GTTGCTCTAG CTAAAGAATA TGACTTCACC TTGACAGTGG CTGGACTGGA 8160 

GTGGATTCAA GCACTCTTAG ATAAGGAAGT GGACCTAACT GGATTGACAG TCCACCTCAA 822 0 

G AT TG AT T C A GGGATGGGAC GGATTGGTTT TAG AG AG GC A AGTGAGGTTG AGCAGGCTCA 8280 

AGATTTGCTC CAACAACACG GTGTTTGTGT TGAAGGAATC TTTACCCACT TTGCTACTGC 83 4 0 

TGATGAGGAA TCAGATGACT ATTTTAATGC CCAGTTAGAA CGGTTTAAAA CT ATTT TAG C 84 00 

TAGTATGAAG G AAGTT C C AG AGCTGGTTCA TGCTAGCAAT TCTGCAACGA CTCTTTGGCA 8460 

TGTAGAGACT ATTTTCAATG CGGTTCGTAT GGGAGATGCC ATGTATGGCC TCAATCCAAG 852 0 

TGGAGCGGTC TTGGATTTGC CTTATGATTT GATACCGGCC TTGACCTTGG AGTCTGCTCT 8580 

GGTTCATGTC AAGACAGTTC CAGCTGGAGC TTGCATGGGC TATGGAGCAA CTTATCAAGC 8 640 

GGATAGCGAG CAAGTCATCG CGACCGTGCC AATCGGGTAT G C AG ATGG AT GGACAAGAGA 87 00 

CATGCAAAAT TTCTCTGTCT TGGTAGATGG CCAAGCTTGC CCAATTGTCG GCAGGGTTTC 87 6 0 

GATGGACCAA ATCACTATTC GATTGCCTAA GCTTTATCCG CTAGGAACCA AGGTAACCTT 8 82 0 

GATTGGCTCC AATGGGGATA AGGAAATCAC TGCAACTCAG G TAG C G AC CT ACCGCGTAAC 8880 

CATTAACTAT GAGGTGGTTT GCCTCCTCAG CGACCGTATT CCGAGAGAAT ATTATTAGAA 894 0 

AAGAAAGGAG TGGAGCATGA ATCTACATCA ACCCTTGCAT GTCTTGCCTG GTGTGGGACC 9000 

AAAGTCAGCA GAAAAATACG CCAAACTAGG AATTGAAAAC TTGCAAGATC TCTTGCTCTA 9060 

CTTTCCTTTC CGTTATGAAG ACTTCAAAAC CAAGCAGGTG CTGGAGCTGG AAGACGGTGA 912 0 

GAAGGCAGTT CTTTCTGGTC AGGTAGTGAC TCCTGCTAGT GTCCAGTATT ATGGTTTCAA 9180 

GCGCAATCGC CTGCGTTTTA GTCTCAAGCA GGGAGAGGTC GTTTTTGCGG TGAATTTCTT 92 40 

TAACCAGCCC TATCTGGCTG ATAAAATAGA GTTGGGAGCA ACCCTTGCTG TCTTTGGAAA 93 00 

AT GGG ACCGC GCTAAGGCTA GTCTGACTGG GATGAAGGTT CTGGCTCAGG TAGAAGATGA 93 60 

CCTCCAGCCT GTCTATCGTC TGGCTCAGGG AATCAGTCAG GCCAGTCTGG TCAAGGTCAT 942 0 

CAAGACGGCT TTTGATCAGG GACTGGACCT CTTGATAGAA GAAAATCTGC CCCAGTCTTT 94 80 

ACTAGACAAA T AC AAACT C A TGTCCCGTTG TCAGGCAGTC CGTGCTATGC ATTTTCCAAA 954 0 

GTATTTGGCA GAATACAAGC AGGCTCTTCG CCGTATAAAG TTTGAGGAAC TCTTTTATTT 9 600 

C C AAATGC AG CTGCAGATGC TCAAGTCTGA AAATAGAGTT CAGGGAAGTG GTCTGGTTCT 9660 

GAATTGGTCT CAGGAAAAAG TGACAGCAGT TAAAGTAAGT CTTCCTTTTG CCCTGACCCA 9720 

AGCTCAGGAA AAGAGTTTGC AGGAAATTTT AACTGATATG AAGTCCGACC ACCACATGAA 97 80 
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TCGTCTCCTA CAAGGGGATG TGGGGAGTGG AAAAACGGTA GTCGCTGGCT TGGCCATGTT 9 84 0 

TGCGGCAGTG ACAGCAGGTT ATCAGGCTGC CCTAATGGTA CCAACAGAAA TCCTCGCAGA 9 900 

GCAACACTTT GAGAGTTTAC AGAACCTTTT TCCCAATTTG AAACTGGCTC TCTTGACAGG 9 9 60 

TTCCTTGAAA GCTGCAGAAA AGAGAGAAGT CTTGGAGACC ATTGCCAAGG GTGAGGCTGA 10020 

TTTGATTATA GGAACTCACG C TCTG AT AC A AGATGGGGTG GAGTATGCTC GTCTTGGTTT 10080 

GATTATTATC GATGAGCAGC ACCGTTTTGG TGTAGGGCAA AGGCGTATTT TACGGGAAAA 1014 0 

AGGTGACAAT CCAGATGTCC TCATGATGAC GGCGACTCCC ATTCCACGGA CGCTTGCCAT 10200 

CACAGCCTTT GGAGATATGG ATGTTTCCAT TATCGACCAG ATGCCAGCAG GTCGGAAGCC 10260 

TATTGTGACG CGCTGGATCA AACATGAGCA ACTACCTCAG GTCTTGACTT GGTTAGAGGG 10320 

GGAAATTCAA AAAGGTTCCC AAGTCTATGT CATCTCTCCT TTGATTGAAG AATCAGAAGC 103 80 

TCTAGATTTG AAAAATGCCA TTGCCTTATC AGAGGAGTTG ACGACTCATT TTGCAGGCAA 10440 

GGCAGAGGTG GCTCTTCTAC ATGGTAGGAT GAAGAGTGAC G AAAAAG AC C AGATCATGCA 10500 

GGATTTCAAG GAGAGAAAGA CGGATATTCT GGTTTCGACG ACGGTTATTG AGGTTGGGGT 10 5 60 

CAACGTTCCC AATGCGACTG TCATGATTAT CATGGATGCC GATCGCTTCG GTCTCAGTCA 10 62 0 

ACTTCACCAG CTTAGAGGTC GTGTCGGTCG GGGGGACAAG CAGTCCTACG CTGTTCTCGT 10680 

TGCTAATCCC AAGACGGATT CTGGGAAAGA CCGCATGCGT AT C ATG AC AG AAACGACCAA 10740 

TGGATTTGTC CTTGCGGAGG AAGATTTGAA AATGCGTGGT TCTGGTGAGA TTTTTGGAAC 10 800 

CAGACAGTCA GGACTTCCAG AGTTCCAAGT GGCTGATATT ATCGAAGATT TTCCGATTTT 108 60 

AGAAGAAGCA AGAAAGGTTG CTAGCTACAT TAGTTCTATA GAAGCTTGGC AAGAAGATCC 10920 

AGAGTGGCGC ATG AT T GC C C TTCATCTGGA AAAGAAAGAA CATCTGGATT AAGCTTTCTC 10980 

TAAGGAAAAC TT AT ACT C AA TGAAAATCAA AGAGCAAACT AGGAAGCTAA CCGCAGGTTG 11040 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 11100 

TGAGGTGGCA GAT AG AAC TG ACGAAGTCAG TAACATATAT ATACGGTAAG GCGACGCTGA 11160 

CGTGGTTTGA AGAGATTTTC GAAGAGTATT AAGCTAGTTT TTAGGTTTGG CTCTTATACT 112 2 0 

AG AGT CATC A AAAAGAAACG AGGACTCTCA TATGACAGTA ACGATTAAAG T AAATT AC C A 112 80 

AACCACTTTC CAAAAGAAGG AAGCAAAAAA C T AGT AT AAA CAGAAGAGAG AGCGAAATGC 1134 0 

TCTTTTTTCG TTTCTAAAAC T ACT TT C AG C CCATCATCCT AAAAGTAAAG AATCTAAATT 11400 

CACTTTCTAT TTACCCTTCT TTCTTGCATT GATTACATAG ATATGCTACA GTTGTGGTAA 114 60 

CGAT T AC AAA ATAAAAGGAG CATGCTATGA AAAATCCAGC TTTGCTAGAA GAAATTAAGA 11520 

CCTATAGAGG AAGGGATGAG GTTCCGGAAG ACTTTGATGA TTTCTGGGAT GGGGAAGTGA 11580 
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AAAATGTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 11640 

TCAAGTGCTA TGAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTC T AT GCACGCATTG 117 00 

TTCTTCCAAA GAGTGAGGAG AAGGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 117 60 

GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 11820 

TGGATGTGCG GGGCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 11880 

CCGTGAAGGG G CAT ATT AT C CGTGGTGCTG TGGAAGGTCG GGACCACCTC TTTTATAAGG 11940 

ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TAGTCTGTCT CAGGTTGATG 12 000 

AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTCTA GTTGCAGCAG 12 060 

CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA G ACTT C AG AC 1212 0 

GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 12180 

TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 122 40 

TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 12 3 00 

ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 12 3 60 

ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 1242 0 

ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 124 80 

AG C AC AAAAT CTTAAAAATT ACAAACACGC ATAGTATCAG GGGATTAAGA AAAC T T TATA 12 540 

CTATGCGTTT TATCATGGAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 12 600 

ACAGTCAAAT CGATTTCTAA CAATGTTTTA GAAACAAATG T GT ACT ATT C TAGTGTCAAT 12 660 

C TAT TAT ATT T AT AG AATT T TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 12 720 

TCAGAAAGCA AAAGCCGATA CCTATCGAGT AGGGTAGTTC TTGCTATCGT CAGGCTTGTC 12 7 80 

TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 12 835 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

GGGGATATGA AGAACAAAAG AAT AT T T AAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 

ATT T AC AC AA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 12 0 
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ACTTTGTATG GTATTGGCTT GTTAGCTCTC ATCGGACTTG CTAGAAATTT TGGAGAGGCT 180 
GGTCAAAATC TTGCAAGCTA CTTGCAGACC TTGCATCAGA GCTTGACGGA TAAAACAAGT 240 
GACTTTCGTT TAATTTTAGG ATTACTGGCC TTTGGTTATT CTTAACACTG TGTTCAGATG 3 00 

GACAAGAAAA GTTGAGAAAA GACCTATTCG AACCTTGGGA TTTTATAGAG AGAATTTCCT 3 60 

CAGCAATCTT CTGAAAGGAT TTAGTCTAGG CCTGGCACTT TTTCTTCTGA CCTTGTTAGG 420 
TTTAGTGGTC TTAGGTCAAT ATCGTTTGGA ATCCATTCAC TTGAATCCTT ATTCTCTTGC 4 80 

CTTTGTCGTC T T TACT AT C C CATTTTGGAT TTTACAGGGG ACAGCAGAAG AAGTGGTGGC 540 
CCGTGCTTGG CTACTTCCTC AATTGGCCTC AAGAACCAAT CTAAAACTAG CTATTCTTAT 600 
ATCTAGCCTG TTCTTTACCC TG CT TC AT AT GGGCAATTCT GGTCTCACCC CTCTATCTCT 660 
AGTAAATCTC TTTTTATTCG GAGTTGCCAT GGCTCTTTAC CTTCTCAAAA CTGATACAGT 72 0 

TTGGGGTGTT GCAGGTATTC ATGGTGCTTG GAATTTTGCT CAGGGTAATC TCTTTGGGAT 780 
TTTAGTTAGT GGTCAACCGT CAGaACGTCT CTGATGACCT TTTTACCACA AGGCAATCAA 84 0 

GATTGGCTAT CAGGTGGTTC TTTTGGCATA GAAGGTTCCA TTATGACAAG TCTGGTATTA 9 00 

CTACTGCTGA TTGTCTATCT TGCTAATAAA TTAAAGAAAG AAAATGAAAG GATGTGACTT 9 60 

CGGTCCGTCC TTTTCTTCGT GAAAATACTA TAAGTATGCT AAAATAGGAA TAGCACATGG 102 0 

AGAGAGGATT CTTATGATCA AT C AC AT T AC AGATAATCAA TTTAAACTAG TATCAAAATA 1080 

TCAACCATCA GG AG AT C AAC CCCAAGCTAT CGAGCAGTTG GTGGATAACA TTGAGGGGGG 1140 

AG AAAAAGC T C AG AT T C T G A TGGGGGCGAC TGGAACAGGG AAG AC C TATA CTATGAGTCA 12 00 

GGTCATTTCT AAAGTCAATA AACCAACTCT GGTTATTGCC CACAATAAAA CTCTGGCTGG 12 60 

TCAGCTCTAT GGGGAGTTTA AGGAATTTTT CCCTGAAAAT GCAGTTGAGT ATTTCGTATC 13 20 

CTACTATGAT TATTACCAGC CAGAGGCCTA TGTCCCTTCT AGCGATACCT ATATTGAGAA 13 80 

GGATAGTTCT GTCAATGACG AGATTGACAA GCTTCGCCAC TCAGCTACCT CAGCCCTTTT 14 4 0 

GGAGCGTAAT GATGTTATTG TCGTGGCCTC AGTCTCTTGT ATCTATGGTT TGGGTTCGCC 1500 

CAAGGAATAC GCTGATAGTG TCGTTAGTCT CCGTCCTGGT C TAG AG AT T T CTCGTGATAA 1560 

ACTCTTGAAT GACTTGGTCG ATATTCAGTT TGAACGTAAT GATATTGATT TCCAACGCGG 1620 

AAGATTTCGC GTTCGTGGGG ATGTGGTAGA GATTTTCCCA GCTTCCCGAG ATGAACATGC 16 80 

CTTTCGAGTA GAATTTTTTG GAGACGAAAT TGACCGTATT CGTGAAGTTG AGGCTCTGAC 1740 

AGGTCAGGTG TTGGGAGAAG TGGATCATTT AGCGATTTTC CCAGCGACAC ACTTTGTGAC 1800 

CAATGACGAC CACATGGAAG TTGCCATTGC AAAG AT T C AG GCCGAGTTGG AAGAACAATT 1860 

AGCTGTCTTT GAAAAGGAAG GTAAACTGCT TG AAG C C C AG CGTTTGAAAC AG CGG AC AG A 1920 
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GTATGATATC GAAATGTTGC GTGAGATGGG CTATACCAAT GGGGTTGAAA ATTATTCTCG 19 80 

CCACATGGAT GGACGGAGCG AAG GAG AG C C TCCTTATACG CTTCTCGACT TCTTCCCAGA 2 04 0 

TGATTTCTTG ATTATGATTG ACGAGAGTCA TATGACCATA GGGCAAATCA AGGGCATGTA 2100 

CAATGGAGAC CGTTCGCGTA AAGAAATGCT GGTTAATTAT GGTTTCCGTT TGCCGTCTGC 216 0 

TTTGGACAAT CGTCCTCTCC GTCGGGAGGA GTTTGAGAGT CACGTTCATC AGATTGTTTA 22 20 

CGTTTCAGCG ACACCTGGTG ACTATGAAAA TGAACAGACC GAG AC AG T G A TTGAGCAAAT 22 80 

CATTCGTCCA ACGGGACTCT TGGATCCAGA GGTGGAAGTC CGTCCGACTA TGGGACAGAT 2 34 0 

TGATGACCTC TTGGGTGAAA TCAATGCCCG CGTTGAAAAA AATGAGCGTA CCTTTATCAC 240 0 

AACTTTGACC AAGAAAATGG CAGAGGATTT GACCGACTAC TTCAAGGAAA TGGGTATCAA 2 4 60 

GGTCAAGTAC ATGCACTCGG ATATCAAGAC CTTGGAACGG ACGG AG AT T A TCCGTGACCT 252 0 

GCGCTTGGGT GTCTTTGATG TCTTGGTCGG AATTAACCTG CTCCGTGAAG GAATTGACGT 258 0 

TCCTGAAGTG AGCCTCGTAG CTATTCTCGA TGCTGACAAG GAAGGTTTCC TTCGCAACGA 264 0 

ACGTGGACTC ATCC AG AC C A TTGGACGTGC TGCACGTAAT AGCGAAGGTC ATGTTATCAT 2 700 

GTATGCGGAC ACGGTTACCC AGTCTATGCA ACGTGCTATC GATGAAACTG CCCGCCGTCG 2 7 60 

CAAAATCCAG ATGGCCTATA ATGAAGAACA TGGTATCGTT CCACAAACCA TCAAGAAAGA 282 0 

AATCCGTGAC TTGATTGCTG TGACCAAGGC AGTTGCTAAG GAAGAAGACA AGGAAGTCGA 28 8 0 

TATCAATAGC CTCAACAAAC AAGAGCGCAA AGAACTAGTC AAAAAGCTTG AGAAACAAAT 2 940 

GC AAG AAG C A GTTGAAGTGC TTGACTTTGA ACT AGC AGC T CAGATTCGTG ATATGATGCT 3000 

GGAAGTCAAG GCCTTGGATT AGGGGAATAG TATGATTTAT TTAAGAAAGT TAAAGAAAGA 3 0 60 

AGATTTGATG TCTTTATGGG AAATGGCTTA TTCACAGCTT AATCCAGTTT GGAAACAGTA 3120 

TGATGCTCCC TATTATGATG AT TAT C AGT A TTTTTCAAAT TTTAAAGAAT TCGAACTACA 318 0 

AAAATCAGAA TCCATTTTAA GCAACTCAAA TCGCCTTGGT ATTTTTGTTG ATGATAAACT 3240 

AGTTGGGACT GTTTCGCGTT ATTGGGTATG TAAAGAAACA AGATGGATGG AATTGGGAAT 3 3 00 

TGGTATTTAT GATAAAAAAT TCTGGAACAC TGGTATTGGG AAAGTTGCTA TGTTGCAGTG 3 3 60 

GATAGATAGG ACGTTTCAGG ATTACTTGGA GTTGGAGCAT CTGGGTTTGA CAACTTGGTC 342 0 

AGGAAATATT GGTATGATGA AACTTGCTGA AAAATTAAGA ATGAAAAAAG AAGCTCATAT 3480 

TCCAAAAGTT CGTTATTATC AAGGTAAATA TTTTGACAGT ATTAAATATG GTATTTTGAG 354 0 

AGAAG AC TGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT C C T G AAG AG A 3600 

TAGAAGGTAA ATCCTTCGTT CACTGGCAAA CGTGGAGAGA GGCTTATGAT GACCTTTTGC 3 66 0 
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CTGCGGAATT T C AGG AG AC A AT G AC AT TAG AAAGATGTCG ACTCTTTAGT CAAAAGTATC 3 72 0 

CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TGGTTTTATA AGTTATGGCA 3 7 80 

ACTGTCGTGA TG AG ACT AT T CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 

ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3 900 

TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CGCCATTGCT TTCTATCAAA 3 960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4 020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 408 0 

GTCTGAAAAT TTAATAAATT AGAAAG CG AG TAAATTTATG TCCCGTTCCC AATTAACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAACTCAG CGCGTGGTGA TGCAGTATCG 42 00 

CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATG TAGAAAATGA 42 60 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

AAATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4 380 

CATTTGTTAT AAGGCGACTG AGTTCTCTGG TACCCTTCAA TCTTCAGAAG AGGGAGAAGT 4440 

TTCTTGGGTG C AAAAAG AC C AG ATT C C AAA CTTAAATCTG GCCTATGATA TGCTACCATT 4 500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGAGTTTTTC TACCCTCGCC GTACAGAAGA 45 60 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA C C T AG C TG AT CCAAGGCCTC 4 62 0 

CTCGATATAG TGGAGGTCTT GTTGTGTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 4740 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4 800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAG C ATCT T T 4 920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCAGCTCTTA AAATTGCAAT 4 9 80 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGCTCA 5 02 0 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 3: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 60 
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GAAGAAGGTC GTGTAGTACT TGAGTTACTG C T AT CG C TAG AACTACTACT TGAACTGCTG 12 0 

GAGCTGGATG GAGTTGGTAG ACTCCCCACA ATACTAGACC AAGCATTCTG ATAATCCGCA 180 

TCACTTCCGC CAATAGCAAA GCGATAACTT GTCGCTGGCG CTCCTGACTT ATTAGCCCAA 240 

TAGCTGGTAA CAGTCGAACC TGTGACCTCT ACTTCTTTTC CTTCAACAGA AACCTTCTCT 3 00 

GGTTTTTGAC CTGTTGATTT CAAGACTTCC GATTTCACTA CACTAGGATC TAAAGCAAAG 3 60 

CGCTCGTTCC CCCAAATGCT TGGGGAAGCT TGCTGAATCG CAT T T AC C AG ATGAGCCATG 42 0 

TAATTAGAGT TATTAGAATA ACCTGCTCTA CGTGACAATG AATGATTATC ATCATGCCCA 4 80 

ATCCAGCCAC CTAGGGTTAA TCTAGGTGTC GAAAGCATGA GCCACATATT TTCGTCTTGG 54 0 

TTGGTTGTAC CAGTCTTCCC AATCCAATCT GCATTAGCCA GAGTAGGATT TAAAGAAGTC 600 

AGGTTAGACT TGAAGGTTGT T GT C AC AC G A GAGGATAGAA CTTCTCGTAG CAATCCCTGC 660 

ATAATCGTCG CAGTAGCTTT TGAATAGACT TGAACCGGTT TATCCTGATA CTCATACACC 72 0 

ACTCTACCAT CTGCTGCTTC AATCTTTGAA ATCACATGCT TCTG AT GAT A AACTCCATTA 7 80 

TTAGCTAAGG TCTGATAGCC ATTGGTATGC TGGGCAACTG TGACTTCAAT ACCACCACCC 84 0 

ATTGGCAAGC TCTCAATACC GTACTCAGGA ATCTCGTAAC CCATCTTTTC C AT AT AAC C C 900 

TTGACATCAA CACCCTTTTC ACGGAGCATA CGATAGGTCC AGTAAGCAGG GATATTCCAT 9 60 

GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC CCTTGCTATT AG CAT AC AT A 102 0 

ATCGGATTGC CATTAGCAAA GTTTGTTGGA TAGTTAGATA GAATCGTTTC ACTTCCCATC 1080 

AAGCCCTGGT CAATAGCAAT ACCGTAGGCC AGCAAGGGCT TGGTAGTAGA AGCTGGCGAA 1140 
t 

CGTTTGGTAT CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 12 0 0 

CCTAGAATAG CACCTGTTTG GT T AT CC AT C AAGACATTCC CTACTTCTAC ACGACCTGTT 12 60 

CCATCGTCTA AAAGATAGCC ATAATCAGCA ACCGCACTTT GCATGGCAGA ATGAATTTTC 132 0 

TGATCTATGG TAGTAGTAAT CTTATAACCA CCATTTTCAA TTTCCTTGGC TGCCAAATCT 13 80 

CGATAAAACT TCTGAGTTGC CTCATTTTTC AACTCCTTAG CGGAGACATT GTCTCTCTGA 14 4 0 

GCTAGATAGT CATACATACG TTCTTGAGCT TCTGCCAAAG TTGTAAAGTA TAAATAGTCT 15 00 

CGTGAAATTC CTGTAACCGT GCCCGATGGT AAAAAGTCCT GTTTAAGGTC ATAATCCTTG 156 0 

TACTGAGAAT ACTCGTCTTT GCTTAATGCA CCTGTACGAT AC AT AC TGT A AAGAACTGCC 1620 

TTAGCCCGTC TTAAGCCAAT TTCTAGGTCT TCATCACTCT TCAACTCCCC AGTATTTTCA 16 80 

TAAGGAGAGT AAGTAATGGG ACTCTGTGGA AGTCCTGCTA AAAATGCTGC TTGAGGAACA 1740 

GTCAACTGAC TGGCATCTAC ACCGAAAATT CCCTCAGCTG CTTGCCGAGC CCCTGCAATA 180 0 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA GCCACATTGA GATAGGTCGT TAAAATCTCA 18 60 

TCTTTATTCA TGGCGCGTTC CAAGGCAAGA G CAT C C AC AA TCTCTGCCGC CTTACGAGCC 192 0 

AAGGTCGGCG CATCCCCAAC CACCTGCTGT TTAATTAGTT GCTGGGTCAA GGTTGAACCC 19 80 

CCACTAGAGG AACCCAAACC TACAAATTTC CCCAAGGTCG CACGAATCAC CGCCTTGGGT 2 04 0 

ACTACACCCT TATGTTCTTT AAAGTGTTCA TCTTCTGTCG CAATGATAGC CTTCTTCAGA 2100 

TTTTCCGAAA TTTGCTCAGA TGAGATAGAA GTGCGCAACA AATCACTCTC TATGGAAGCA 2160 

ATCACCGTCC CGTCCGAATA GGTAATCTCT GAAATAGAAG AGATGTCCTT GACCTGATTC 2220 

ACCAATTCTT CTGTCTGAGG CACCCGAACC TTGTCAAATA AGGCCACTCC GTATCCCAAA 2 2 80 

GCAATCCCAG CTCCCAACAT TCCTCCTAGA AAACCGAGTA CAAAGAGTAA GTTAAATAAG 2340 

GCTTTTATAC TCAGTAAAAT AG C TGGG AAA ATGACTGACT TATCTAAGGT TTTAGATTTT 2400 

TTGGTACTTG AACCTTTCTT GCCAGGTCTA GCTGATTTTT TATTTTTTTG TTTTTGCTGG 24 60 

AAAAATTCCA GCATTTTTCG TTTTAATTCA TTTAATTGAT TTTGCATGGA TTTCCTCACT 2 52 0 

TTATCTATTA TACCACAAAA GGGAAATTTT CAATAAAATA GCCACTTTCT TCCCTATTCT 2 580 

GCTAGGCTAT TGCCCAAGTT T GT G AT AC AA TAGGTAGAAA CAATAATTTT AAAAAGGAGA 2 640 

AAAAACACAT GCACATTTTT GATGAGCTAA AAGAGCGTGG TTTGATATTT CAAACGACTG 2 700 

ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG AAGGTCAAGT TTCTTATTAT ACTGGCTACG 2 7 60 

ATCCAACTGC TGACAGCCTT CACCTAGGCC ACCTTGTCGC AATCTTGACA AGTCGTCGCT 2 82 0 

TGCAACTAGC AGGTCACAAA CCTTATGCGC TCGTTGGCGG TGCTACAGGT CTCATCGGAG 2 8 80 

ATCCGTCCTT CAAAGATGCT GAACGTAGTC TCCAAACAAA AGACACAGTA GATGGCTGGG 2 940 

TCAAGTCTAT CCAAGGACAA CTTTCTCGTT TTCTTGACTT TGAAAATGGC GAAAACAAGG 3 000 

CTGTCATGGT CAACAACTAC GACTGGTTTG GC AG CATC AG CTTCATTGAC TTCCTCCGTG 30 60 

ATATTGGAAA ATACTTCACG GTCAACTACA TGATGAGTAA GGAATCTGTT AAAAAACGGA 312 0 

TCGAAACAGG AATTTCTTAC ACTGAGTTCG CTTACCAAAT CATGCAAGGG TATGACTTCT 3180 

TCGTCCTTAA CCAAGACCAT AATGTCACTC TTCAAATCGG TGGTTCTGAC CAGTGGGGAA 32 40 

AT ATG AC AG C TGGTACCGAA TTGCTTCGTC GTAAGGCGGA CAAGACTGGT CACGTTATCA 3 300 

CTGTTCCACT AAT C AC AG AT GCAACTGGTA AGAAATTTGG TAAATCAGAA GGAAATGCCG 3 3 60 

TCTGGCTCAA TCCCGAAAAG ACTTCTCCAT ACGAAATGTA CCAATTCTGG ATGAACGTGA 3 42 0 

TGGACGCTGA CGCTGTTCGC TTCTTGAAAA TCTTTACTTT CTTGTCACTT G ATG AG ATT G 3 4 80 

AAGATATTCG TAAACAATTT GAAGCAGCGC CACACGAACG CTTGGCTCAA AAAGTCTTGG 3540 

CTCGTGAAGT TGTT AC AC T T GTTCACGGAG AAGAAGCCTA CAAAGAAGCA CTTAACATCA 3 600 
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CTGAGCAACT CTTTGCAGGA AACATCAAAA ACCTTTCTGT CAAAGAGCTC AAACAAGGAC 3 660 

TTCGTGGTGT GCCCAACTAC CAAGTACAGG CAGACGAAAA CAACAATATC GTGGAACTGC 3 720 

TCGTCTCATC TGGTATAGTT AACTCAAAAC GCCAAGCCCG TGAAGACGTC CAAAACGGAG 3780 

CCATCTACGT AAACGGCGAC CGCATCCAAG AGCTTGACTA TGTCTTGAGT GACGCTGATA 3 840 

AGTTAGAGAA TGAACTGACT GTTATCCGTC GTGGGAAGAA AAAATACTTT GTATTGACTT 3 900 

ACTAAACTAT TCAACATTTA TCTATAAACA AAGGAGTTAA CCTCGAGAAA GGTAACTCCT 3 960 

TTTTGCTGTT AATAACTCTC ATCTATCTAT TTTTAATAGA CAGGCTACGC AGGACAATGC 4 020 

GCAAGGTTGT TAGATTATGT AAGATAGAGA GATTTGAAGG ACTGAACCAA TTAAATAAGC 4080 

CAAAGCCAAT CAAACTACTA TTTACGACAA CGGTATCCTG AATATTTTTC TTGATGAGTG 4140 

TTTGCAAAGA TGATGATAAC GAATCCAACT CTTGGAAGAA AT C C AAACG A TTATCTAACA 42 00 

AT AAG AT AT C ACTCATCTGC T T AGAAAT AT CTGCACTCTC ATT CATC AC C ACACCGATAT 4 2 60 

CTGATAGAGT TAAAGCCGCT GAGTCATTCA ATCCATCTCC AAC CATC AAA AT AGT GTG AC 4320 

CTGCTTTCTG CAGTTTCTCT ACTAACTCAA ATTTCCCATC AGGTTTCAAG TCTGTATAGA 43 8 0 

CCTGATCAAA GGGCAAATCT TTGACTAATT CCTCTGTCCT AATCAAGGTG TCTCCTGTTG 4440 

CCAGAATCAA TTTTTTCCCC TGTGCCTTAA GTTTATCCAA GGCTGTTTTT GCTTCTTTTC 4 5 00 

TCAAAGGAGT ATGAATGCAG AACATTCCAA TCAATTCATT TTGATAAGCC AAGAATAAGA 4560 

GATTGTAGTG ACTCTTGTAC TCTTCAATTA AAG C ATT T T G TTCTGAACTG ATATGAATCT 4620 

GCTCATCCTG CATCAAGACA TAATTCCCAA TAAGAACTGG TTGGCCATCT ATATGAGATT 46 80 

TGATCCCCTT GCTTGCGATA TATTGGAGTT TCCCATGCAT TTCCTCATGT TCAATTCCCT 4 740 

CTATCTCAGC TTGCTTGACG ATGGCATTAG CAATAGGATG ATAAATGTGT TCCTCAAGAC 4 800 

AGGCACTGAT TCTGAGAATA TCTTCCTCAC TATAGTCTCC AAAAGGTAAC ACCTTTTCAA 48 60 

C T AT AGGAT A ACTAGTTGTG ATTGTTCCTG TCTTATCAAA CAAGAAAGTA TCAACTTCCA 492 0 

GATATTTCTC CCTGTTGTGG CCTCTGGCTG TCATCTCTGT GCTGG 49 65 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCGTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 60 
TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 12 0 

AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT AGTGG T AG AT ATTGTTCAAC 180 
AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA AT AC AG T T AT TGTAGCACTT 24 0 

AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA TAAGGATTTT TAT CAT AG AC 3 00 

AT AAAATT T C TGAAATTTCC AAACAAAATA TTTTAAAAGT TTTGAAAAAG AGTTAAGATA 3 60 

TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG G AC ATT T TAT GT CAT AC AAA 42 0 

ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 480 
CAACAAGTTA TTCCTAAAGC AGCATTTGGC TAT AT C G C T A GTGGGGCGGG AG AT ACTTT C 54 0 

ACTTCTTTCC AGTGATTTTA GCGTCAGGTT CTTTTTAGTT TTTAAAGATT ATCCGTGAAT 6 00 

TTCTTGCTTA TTTATGATAA AATGGGAGTG TCGCAAAAAA TGACTCATCG TATTCAATTT 660 
TGAGTAAAAC TAGGAGGATC CCATGTCTAC AGAACATATG GAAGAACTAA ATGACCAGCA 72 0 

GATCGTTCGC CGTGAAAAAA TGGCTGCGCT CCGCGAACAA GGAATCGATC CTTTCGGAAA 7 80 

ACGTTTTGAA CGTACTGCAA ATTCACAAGA ATTAAAAGAT AAATATGCCA ACCTCGATAA 84 0 

AGAACAATTA C AC G AT AAAA ACGAAACAGC TACTATCGCA GGACGCTTGA TAACCAAACG 900 
TGGTAAAGGA AAAGTTGGTT TTGCCCACCT TCAAGACCGC GAAGGCCAGA TTCAGATCTA 9 60 

CGTTCGTAAG GATGCTGTCG GTGAAGAAAA CTACGAAATC TTCAAAAAAG CAGACCTTGG 1020 

TGACTTCCTT GGTGTCGAAG GT G AAGTG AT GCGTACGGAT ATGGGAGAAC TCTCTATCAA 1080 

GGCAACCCAC ATCACACACT TGTCTAAGGC TCTTCGTCCT CTTCCTGAGA AATTCCATGG 1140 

TTTGACAGAC GTTGAAACAA TTTACCGTAA ACGTTACCTT GACTTGATTT CTAATCGTGA 12 00 

AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 12 60 

CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 132 0 

TGCCCGTCCA TTTATCACCC AC C AC AATG C CCAAAACATT GACATGGTGC TTCGTATCGC 13 80 

GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 1440 

CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 1500 

AGTTTACCAA GCTTATGCAG ACTTCCAAGA CAT C AT GG AC TTGACTGAAG GCATTATCCA 15 60 

ACACGCTGCT AAATCAGTCA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 1620 

GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT AT C AG AG AAA TTACTGGTGT 16 80 

CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 1740 

TCCAGTTGAG AAAC AC T AC A CTGAGGTTGG T C AC AT C AT C AATGCCTTCT TTGAAGAGTT 1800 
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TGTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATCCAGTAG CTGTATCTCC 1860 

ACTCGCTAAG AAAAATCCTG AAGACCAACG CTTTACTGAC CGTTTCGAGC TCTTTATCAT 192 0 

GACTAAGGAG TACGGTAATG CCTTTACTGA GTTGAACGAC CCAATCGACC AACTTAGCCG 1980 

TTTTGAAGCC CAAGCTAAAG CCAAAGAACT TGGTGATGAT GAAGCGACAG GAATCGACTA 2 040 

TG ACT AC AT T GAAGCTCTTG AATACGGTAT GCCACCAACA GGTGGTTTGG GAATCGGTAT 2100 

CGACCGTCTC TGCATGCTCC TC AC TG AT AC AACAACTATC CGTGATGTAT TGCTCTTCCC 2160 

AACAATGAAA TAAATTCTTA TCCTCTGGGT C T T AT C AG AG GATTTTTTGA TTCAAAAAGA 2220 

GACTGAATTT AAGGAGAAAA TGAAGTGTAG TATATTGAAA TTGAAATAGT ACACTTTGAT 22 80 

TTCTAAGACA T TGT TAG AAA TTGGTTTAAA TTCCCTAAGC AATTTGTGCA TGTTTTATTT 2 340 

CATTTTACGA TAGTACGCTG AAACTTTTCA AAAAGTACTA G AAATTG AC T TGGATTCCCC 24 00 

AATTGATTTG TTCAGATTCA CTATAAATAA AAAATTAATA AGTGGGATAG GAAGTTAGCG 2 4 60 

TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG TCCATAGGTC 2 52 0 

CTATTAGAGG ATGTTCTGGT GTCTTATTCA CTTGTTTTTT ATAGTATTAG TAGATAGAAT 258 0 

CAGCAAATAA AAACCCAAAT CATTCATACC TCTCTCAACT AGATGTAACT TACAAAACCC 2 64 0 

CTGACCTCAT GAG C C ACTTT CTTCCTCCTC ATGAGGTCAG TTTTACTTTC TGCTGTTCCA 2 7 00 

GTATCGTTTT TCCTCGCTAG ATTTCCTCAA AAGGGCAGAC TCCTCCCTTG GTGCGTCACA 2 7 60 

CGATTTTTTC ATCTCGACTG TTCTTTAATG CATCATTAAC GACGCTTTTC TTCTAGGTGG 282 0 

TTCATAAGGA ACAGGAAGAT TCAGGTTGAC TTTTCTAATC C T AG AAT AAA GTGCTGAAAA 2 880 

CAATTCGGAA TAGGCATAGA GACTAGACAA TTTGAGGAGC TGCTTGCGTC CTGTTCGAAC 2 94 0 

ACATTTTCCC ACCACGTGAA G AAAAAG AT G GCGGAAGCGT TTGATTGTTA AAGTTTGGAA 3000 

GTCACCTCCA GCTAGATGTT TGAGAAAAAG ATAGAGATTG TAGGCGATAC AGCTCATCAT 3 060 

CATACGAACT TCGTTTTTGA TTAAGGTTGA ACTATCCGTT TTATCGCCAA AAAATCCCTC 3120 

CTTCATCTCC TTGATGAAAT TCTCGGCTTG ACCACGTCCA CGATAAAGCT GAAACTGGTC 3180 

TTGGCTTGTT CCACTCGTCA TATTTGTAAC GAGAGAAATA ACATCGTAGA AC 3232 

(2) INFORMATION FOR SEQ ID NO: 145: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 5: 

CCGGAGAAAA TGATGAAAAG TTCAAAACTA TTTGCCCTTG CGGGCGTGAC ATTATTGGCG 60 

GCGACTACTT TAGCTGCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 12 0 

TC AT AC AT T T ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 180 

ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AAATGATCGC TACGGGAACT 2 40 

TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 3 00 

CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 3 60 

CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 42 0 

TTGTTCAAGA AT C AAT C AAA GGGTTGGATG C CT ATGT AAA AGGGGAAATC AAAGATTTCT 4 80 

CACAAGTAGG AATTAAGGCT CTGGATGAAC AG AC AGTT C A GTACACTTTG AACAAACCAG 540 

AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 600 

T G AAT TC AAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 660 

GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 72 0 

ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 7 80 

AAG AT AC C AG CAAACCTGCA GAAAACTTTA AAG AT GGT AG CCTTACAGCA GCTCGTCTCT 84 0 

ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGT C TATA 900 

CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 960 

AC AC AT C T AA GACCAGCGAC GAACAAAAGG C AT CG AC T AA AAAGGCTCTC TTAAACAAGG 102 0 

ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 1080 

GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 114 0 

CAGATGGTAA AAACTTTGGC GATATGGTCA AAG AG AAAT T GGTCACTTAT GGGGATGAAT 12 00 

GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA C AAT C C AG AA AAAGCCAAGG 12 60 

CTGAATTTGC T AAAG CT AAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 1320 

TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 13 80 

AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TG AT AT TC AA CAACTACAAA 144 0 

AAGACGAAGT AAAC AAT AT T ACATATTTTG CTGAAAATGC TGCTGGCGAA G ACTGGG AT T 15 00 

TATCAGATAA TGTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 15 60 

TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA 162 0 

ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA AT TGGTT AC T GAGGCTGGTG 1680 

ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 174 0 
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CAGATAGTGC TTTGATTATT CCAACTACAT CTCGTACAGG GCGTCCAATC TTGTCTAAGA 18 00 

TGGTACCATT TACAATACCA TTTGCATTGT CAGGAAATAA AGGTACAAGT GAACCAGTCT 18 60 

TGTATAAATA CTTGGAACTT CAAGACAAGG CAGTCACTGT AGATGAATAC CAAAAAGCTC 1920 

AGGAAAAATG GATGAAAGAA AAAGAAGAGT CTAATAAAAA GGCTCAAGAA GATCTCGCAA 1980 

AACATGTGAA ATAACTGTTG CAAAATATAA GAAAGGATTT AGTATTTCCC TTGAATGCTG 204 0 

AATCCTTTTT TACATTTGTA AAGAAAGATT CTAAAATGTA CGGACCCCCA AAAGTTGGAG 2100 

CCTCTTTTTG TCAGAATAGA GAAAATTTTT GTTAATTTTA CTTGTTTCCT ATTGCTTTCT 2160 

CAGCTATTAT TTGTTATATT AAAAGTATAA TTATTTTTTA TTTATCAGAG TTAAGCATTG 2 22 0 

CACTTTCAGA GGAAGGAGTA TTTTTTAAAA AGAAAATGTA AACGTTTGCT CAAAAATGAA 22 8 0 

AGGATTTAGA AGTTTATGAA TAAAGGATTA TTTGAAAAAC GTTGTAAATA TAGTATTCGG 2340 

AAATTTTCAT TAGGTGTTGC TTCTGTTATG ATTGGAGCTG CATTCTTTGG GACAAGTCCG 24 00 

GTTCTTGCAG ATAGCGTGCA GTCTGGTTCC ACGGCGAACT T AC C AG C TG A TTTAGCTACT 2 4 60 

GCTCTTGCAA CAGCAAAAGA GAATGATGGG CGTGATTTTG AAGCGCCTAA GGTGGGAGAA 252 0 

GACCAAGGTT CTCCAGAAGT TACAGATGGA CCTAAGACAG AAGAAGAACT ATTAGCACTT 25 80 

GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 2 640 

GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA GCAACAGGGA 27 00 

ACAGTCACTA TCCGAGAAGA AAAAGGTGTC CGCTACAACC AACTATCCTC AACTGCTCAA 27 60 

AATGATAACG CAGGCAAACC AGCCCTGTTT GAAAAGAAGG GCTTGACCGT TGATGCCAAT 2 82 0 

GGAAATGCAA CTGTTGATTT AACCTTCAAA GATGATTCTG AAAAGGGCAA ATCACGCTTT 2 880 

GGTGTCTTTT TGAAATTTAA AG AT AC C AAG AATAATGTTT TTGTCGGTTA TGACAAGGAT 294 0 

GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 3000 

GCTGCTCCTG AAACAGGATC AACAAACCGT CTCTCTATCA CTCTCAAGTC AGACGGTCAG 3060 

CTAAATGCCA GCAATAATGA TGTCAATCTC TTTGACACAG TG ACT C T ACC AGCTGCGGTC 312 0 

AATGACCATC TTAAAAATGA GAAGAAGATT CTTCTCAAGG CGGGCTCTTA TGACGATGAG 3180 

CGAACAGTTG TTAGCGTTAA AACGGATAAC CAAGAGGGGG TAAAAACAGA GGATACCCCT 3240 

GCTGAAAAAG AAACAGGTCC TGAAGT TG AT GATAGCAAGG TGACTTATGA CACGATTCAG 33 00 

TCTAAGGTCC TCAAAGCAGT GATTGACCAA GCCTTCCCTC GTGTCAAGGA ATACAGCTTG 3 3 60 

AACGGGCATA CTTTGCCAGG ACAGGTGCAA CAGTTCAACC AAGTCTTTAT CAATAACCAC 3420 

CGAATCACCC CTGAAGTCAC TTATAAGAAA ATCAATGAGA CAACAGCAGA GT ACT TG ATG 34 80 
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AAGCTTCGCG ATGATGCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3540 

GACAATCAAT TGCACTTTGA TGT G ACT AAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3 600 

CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3 6 60 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3 720 

ACGCATGTCA GCGGAGATGA T CAT AT C GAT GTAACCAATC CAATGAAGGA TTTGGCTAAG 3 7 80 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 384 0 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCGTT TGACAGCTTA TAAAGAAACA 3 900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3 960 

AAGGGCATTG TTTTCCCAGA AT AC AC G AAG GAACTTCCAA GTGCTAAGGT TGTTATCACT 4 020 

GAAGATGCCA AT GC AG AC AA GAACGTTGAT TGGCAAGATG GTGCCATTGC TTATCGTAGC 4 08 0 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 42 60 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCGAA 4 320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTAGG TATCCACGTT 43 80 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGT AT C AA CATTGATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4 620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4 680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCGCTTTAT CCGTAACCAC 4 8 00 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 48 60 

CTAGGTGGCT ACAGCATGAA AGAC TTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4 920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4 9 80 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGGTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA G ATGT AC T AC TTCAATACGC AGGCCGGTGC AACAACTTGG 52 80 
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ACCCTTCCAA GCGATTGGGC AAAGAGCAAG GTTTACCTTT ACAAGCTAAC TGACCAAGGT 5340 

AAGACAGAAG AGCAAGAACT AACTGTAAAA GATGGTAAAA TTACCCTAGA TCTTCTAGCA 5400 

AAT C AAC CAT ACGTTCTCTA TCGTTCGAAA CAAACTAATC CTGAAATGTC ATGGAGTGAA 5460 

GGCATGCACA TCTATGACCA AGGATTTAAT AGCGGTACCT TGAAACATTG G AC C ATT T C A 552 0 

GGCGATGCTT CTAAGGCAGA AATTGTCAAG TCTCAAGGGG CAAACGATAT GCTTCGTATT 5580 

CAAGGAAACA AAGAAAAAGT TAGTCTCACT CAGAAATTAA CTGGCTTGAA ACCAAATACC 564 0 

AAGTATGCCG TTTATGTTGG TGTAGATAAC CGTAGTAATG CCAAGGCAAG TATCACTGTG 57 00 

AAT ACT GGTG AAAAAGAAGT GAC TACT TAT ACCAATAAGT CTCTCGCGCT CAACTATGTT 5760 

AAGGCCTACG CCCACAATAC ACGTCGTGAC AAT GC T AC AG TTGACGATAC AAGTTACTTC 5820 

CAAAACATGT ACGCCTTCTT TACAACTGGA GCGGACGTCT CAAATGTTAC TCTGACATTG 5 880 

AGTCGTGAAG CTGGTGATCA AGCAACTTAC TTTGATGAAA TTCGTACCTT TGAAAACAAT 594 0 

TCAAGCATGT ACGGAGACAA GCATGATACA GGTAAAGGCA CCTTCAAGCA AGACTTTGAA 60 00 

AATGTTGCTC AGGGTATCTT CCCATTTGTA GTGGGTGGTG TCGAAGGTGT TGAAGATAAC 60 60 

CGCACTCACT TGTCTGAAAA ACACAATCCA TATACACAAC GTGGTTGGAA TGGTAAGAAA 6120 

GTCGATGATG TTATCGAAGG AAATTGGTCA CTCAAGACAA ATGGACTAGT GAGCCGTCGT 6180 

AACTTGGTTT ACCAAACCAT CCCACAAAAC TTCCGTTTTG AAGCAGGTAA GACCTACCGT 62 40 

GTAACCTTTG AATACGAAGC AGGATCAGAC AATACCTATG CTTTTGTAGT CGGTAAGGGA 6300 

GAATTCCAGT CAGGTCGTCG TGGTACTCAA GCAAGCAACT TGGAAATGCA TGAATTGCCA 63 60 

AATACTTGGA CAGATTCTAA GAAAGCCAAG AAGGCAACCT TCCTTGTGAC AGGTGCAGAA 64 20 

ACAGGCGATA CTTGGGTAGG TATCTACTCA ACTGGAAATG CAAGTAATAC TCGTGGTGAT 6480 

TCTGGTGGAA ATGCCAACTT CCGTGGTTAT AACGACTTCA TGATGGATAA TCTTCAAATC 654 0 

GAAGAAATTA CCCTAACAGG TAAGATGTTG ACAGAAAATG CTCTGAAGAA CTACTTGCCA 6 600 

ACGGTTGCCA T G ACT AACT A CACCAAAGAG TCTATGGATG CTTTGAAAGA GGCGGTCTTT 6660 

AACCTCAGTC AGG CCGATG A TGATATCAGT GTGGAAGAAG CGCGTGCAGA GATTGCCAAG 6720 

ATTGAAGCTT TGAAGAATGC TTTGGTTCAG AAGAAGACGG CTTTGGTAGC AGATGACTTT 67 80 

GCAAGTCTTA CAGCTCCTGC TCAGGCTCAA GAAGGTCTTG CAAATGCCTT TGATGGCAAT 6840 

GTGTCTAGTC TATGGCATAC ATCTTGGAAT GGTGGAGATG TAGGCAAGCC TGCAACTATG 6 9 00 

GTCTTGAAAG AAC C AAC TG A AATCACAGGA CTTCGCTATG TTCCGCGTGG AT C AGGTTCA 6960 

AATGGTAACT TGCGAGATGT GAAACTTGTT GTGACAGATG AGTCTGGCAA GGAGCATACC 702 0 



WO 98/18931 



PCT7US97/19588 



976 

TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG AT ATT G AC T T TGGTAAGACA 7 080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGAGATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 72 00 

TTGTCAGGCT ATGAAGCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 72 60 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 73 2 0 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7 3 80 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 744 0 

GGTAAGACGC C AG AT TAT AA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7 500 

CCAGCAACAG GTGAGAGTCA AT CT G AC AC A GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7 560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7 62 0 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7 680 

GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT G AC AAAAT C A 774 0 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7 800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7 860 

TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7 920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8 040 

TTCCTTCAGA CAGGTAGGTC AT C AT AGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8100 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA T AC AG C C AT T GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGGTCTTG ATGTGGTTTT 82 20 

CAATTTCCTC GTCTGTATAG TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 83 4 0 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TT T TC ACT AC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8 52 0 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 858 0 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8 640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 87 00 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCAGTTG 87 60 

AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC 882 0 
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CCCAGTCGCG GTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 888 0 

GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 894 0 

CTTGCAATTC ATCGTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCC C AC AG AT 9000 

GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 9 06 0 

AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAAGCAG 9120 

GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 918 0 

TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 9240 

GGCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 9300 

TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 93 60 

TTTGAGCTGC TGTCTGCATT CCATAGTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 942 0 

AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 948 0 

GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 9 540 

ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 9 600 

CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 9 660 

AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTAGC TTGGGCATTG ACGCGGCGTT 972 0 

CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 97 80 

CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GTAGTAGAAA AAAACTCCTC 9 840 

CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 9 900 

GGCGCAGAAT CGTTGGTTTG TTCATCTTGT T TT AC C AC C T AATAAATGTT CTTTGATAAC 9960 

ATTGAGATAA GGAATTTGAG GGAAGGCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 10 020 

ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 10080 

TGCCGGCAAT AAGTAGGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 1014 0 

TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 10200 

AATCGCACGT TTTTGTTTTG TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 102 60 

GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 10320 

TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 10380 

CAAGCCCTGA GACAAATAGT AGTCGTTGGT AGCATTGATC ATCTTTTCAA AGGGTACCGA 10440 

GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 10500 

TCCACACAAC AT ACGAGC CG G AAG CAT AAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 105 60 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 1062 0 

CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 10711 
(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 

TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATC AT T AG AC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 24 0 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 3 00 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 3 60 

CTATTCTAAA TTTCTTAATC T AC T AT AG AA TC AAGAAAT C TACCACCTTC TTTAAATACC 420 

C T C C ATT AT C ACATAAACAG GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 4 80 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 54 0 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTT AT T GAT AAAAAATTCT GTTTGGTTCA AGTTATGAAG GAGAATCTTT TCCATTAATT 6 60 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AG CG AC ACGT CTCAGCCAGA 72 0 

T ATT AT CTG A TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 7 80 

GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAG 84 0 

ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TG T TC AT AC A CCAAACTCCC 900 

TACT AC T CG A TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAGCTTAGGC AAATCGGTTT CCTTTAGATA AGACTGCATT GCTTTCAAAT AGTTAGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT CCCAATCGAT 1080 

AATCTTTGTT TTTTTCGCTT C T GGAAAAT A TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCGCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACT C AT CT AA AACTCCTCTT 12 6 0 
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GCCCCACCAA ATGGTGCTGA AAGGCATAGA CAGCCGCCTG GGTACGATCG CTGACTTCAA 13 2 0 

GTTTGGCAAG AATATTGGAC ACGTGGGTCT TGACCGTCTT GAGAGAGATA AAGAGGTCAT 13 8 0 

CTGCGATGCG CTGATTTTCG TAGCCCTTGG CGATGAGTTG GAGAACATCT CGCTCACGCG 1440 

CAGTCAATTC TTCATGAAGT TCCATATGAT TGCGGTGGTA TTCAACCTTC TTGCTAACCT 1500 

CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 15 60 

CACTAGAAGT CTTGAGCATA TAGCCTTTGG CACCAGCATC TAAGACTGGC ATGATTTTTT 162 0 

CATTGTCCAA ATAAGAGGTC ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 16 80 

CTAAGGTCGC GTCAATCCCA TTCATCTCAG GCATGACAAT AT C C AT G AC A ATGACATCTG 174 0 

GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 

CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA T C GG AC C ATT TCATGGTCAT 18 60 

CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 192 0 

AT AT C AAC C G CCAGCCCTTG CTTGGGAGCT GTCAAGAGTT GAACTGTTCC AGCCATATCT 19 8 0 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2 04 0 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG ATAGAGGTAG 2100 

ACATCTAGGC AAGATGCCTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 222 0 

CTAAGATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAAGCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 2 340 

GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT G AC TG AT AC C CGATAAAATC ATGTGGGCCG CAAACAACTC CTGACTGACT 2 4 60 

GTATCGTGCA AATCCCGAGC AATTCGCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2 52 0 

GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 2580 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 2 64 0 

AATAAACGCT T G AGAT TAG C CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAGACAGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2 82 0 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG G AGGT GAG AG CAATAATGAC ATAGGCTTGT 2 8 80 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGT C AAG AAA ATCTTGACAC 2940 

TCTTGTTACT C T T G AGAT AG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 300 0 
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GCTTGGGCTG GTTGAAAAAA ATCAAATCCC CATAGAGACA GTTAACGCTG AGACTGACTT 3060 

CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT CTTTCTGAGG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATGATGA AGATTTCCAA 32 40 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3 300 

CTTTTTCCTG G T T CATC AT C GGGTAAAGAA GAAAGAGGCT AT AG AT AAC C GCAACAAAAA 33 60 

TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA GAAGAGAATG GTTGCCGCTA 3 420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 3480 

AT AGT AT C AG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGA GCTCCTGTCA 3 54 0 

GAAGACAGGC TTCGATAAAT AAAAAG ATT T TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3 6 00 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG GATGGAAAAA AGGCGCTGGT 3 660 

TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG ATAGTCTACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG T AG AT CTGC T 3 7 80 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3840 

ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 39 00 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3 9 60 

AGAAGGGCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 402 0 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTACGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TG C AT G AC AT TGACATGCAC TTGCAATAGA 42 60 

AC AGG ATT C A TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4 320 

TTGGTTCCAA GGAGGAGATT GGGATGACTA G AC T TG AC AG AAAAAG AATC ATCCGTTGGA 4380 

TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACAGGA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 45 00 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGAT CG AT TTCATCCAGA 45 60 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT ATAGGAACTT 462 0 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4 680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4 800 
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AACGGTCTGC AAATCTTGAC TGGCTTCTTT CAACTGTCTA AGCAAAGGCG TGTAAATATC 4 8 60 

TGTACTCAAG CCTTCTAAAA GCTTGCTGGC TACTTCTACT TGATCGATAA TCTTTTCTGA 4 92 0 

TTTCCCCTGT TCCAAGGCTT CTACCAGAGA AGTCACCGTT TCTTTTGAGG AAGTTAAAAA 4 980 

ATTTTGATTG ATATTTTGCT TGATTTGCTG GACCATGTGA CTCGATACAG CCACTTCCTT 504 0 

GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 5100 

CCAATCACGC TCCAGAACTG TCGCCAAGTT TTCTTCTTCT AACCAAGCAG CCACCTTCTG 5160 

GCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA GGTCGCCCAT 522 0 

GGAACCATTG TCTCCTCGCT TAAGCAAGAC AGCGCTAGTC AGCTTGAACA AGAGCTCCTG 52 80 

ATCAACAGAA ACATCATACA GAGCCAGTAA AGCCTTGACA ACCAAGACAA CGACGCTGCC 53 40 

ACTAGAACCT AGACCAAACT TTTTCCCTTC TCGTTCCATT TTGCCACAGA T T T C TAG AG A 54 00 

AAAAGGTCTT AAATTCTGAC CACGAACAGC GAGGAAGTCT CCCATCAAAG CAATCGTTTC 54 60 

TTGAATCAAG CTATAGTCAG GATTAGGCCT TAAGTCCACT GCGAAATCAA ACATATCTGA 5 520 

ATAGATACGG TAG C T GT C AG AAAAAGCAAT CTCAGCCCTC ATATAGATGG GAATATCCTT 55 8 0 

TATCAAAGCT AACTGCCCTG GCTCTAAAAT AGCATATTCA CCTGCCCAAT AGAGTTTTCC 5 64 0 

GCAAGTTTTA AC AG C AAT C A TCTTG ACT C A AATCCTTTGT TTTTGACACA ATCAAGCGAT 57 00 

AACGATGACC GAAAATTTCT GATAAATGCT CCAAGTCTTT CTCCTGACAG AAGACCTTAA 57 60 

CATTGGGACC AG CAT C C ATG GT AAAGT AG C AGGCCTCTCC TTTCTCACGA AGCTGGCGAA 582 0 

CAAAGGCCAT AGCCTCATAA GAGGCATCCG TCAGATAAGA AAAGGCTGGA CTAGCAGTCT 58 80 

TTGTCGTAGC ATG CAT AG C C AGGGCATTTT TCTCCGTTAA TTCTCCAATC TTGGCAAAAT 5 94 0 

CATTTTCCTT GAGATAAATC AGCATATCCT GATAGTCCTT CTCAGACTGA CGAACCCAGT 6000 

CGTCGAAAGT CGTCGAGGTT T C C AC AC AAA GTTTCATCCC GTCACGGCTA G AG ATTGGT T 60 6 0 

TTTTCTTGTC CTCTAGCACC AACATAATCA TAGCTAGTTT CAAGTCTGTC TCTACAGGGT 612 0 

AAATTTCTCC ACTATCCTTA TCCCAGGCTC CTAGTGGTCC AT AAAAAC T C CGAGAAGAAG 6180 

AACCTGAGGC AAATTTGGCT TCCTGTGCCA ACTGACTTCT ATCCAATCCA AGCTTGAAAT 62 4 0 

AAGC AT T AC A AGCCTTGACC AGGGCGGACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 63 0 0 

CCGTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 63 60 

GGTCAATAAT CTTACTCATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT 642 0 

AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATACATAT 64 8 0 

TTTCCAAAGT TAGAGAAATA CTGCTAGTAG CAGGCACCAT CTCTTTTTCT TTTTTCTTTC 654 0 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGGAACGTAC TGTTACAGGC TCTCTATCCA 6600 

TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTAGT TCTTGTGCGT GTGTCAAATT 6660 

GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 6720 

ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 67 80 

TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 684 0 

AATCGCAACT TCTGCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGGGCATC 69 00 

CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGAGTA T G AC CAT AAA CACCCGTATC 6 960 

GGCAATCACC AAATAGGCGG ATAAATCCAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7 02 0 

AAAGCGAATA GGTTGGTCAC TAAGACAGGT CTTAGCATCC AAAC C AC TAG GATTCATATG 7 08 0 

GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 714 0 

ATAGTAGTCA AATACTGCAC GAATGGCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 72 00 

CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 72 6 0 

AT ACT C C AGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 73 2 0 

ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGAGGCA GGGAAATGGC 7 3 80 

AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 744 0 

GACACCAACT TTTTTTGTCA TTTTTTCCTT TTACTAGACG AAAAAACGTC TTATTTTTCA 750 0 

TACAAGTATT AATTCTTTCC TATCTATTTT ATTATATTTT CACAAAAAAA GCGATTGTTT 7 5 60 

C C ATT C AC AA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7 62 0 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTAGT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 774 0 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATAGGGTTC 7 8 60 

AATCTTCGCC CATTCTTGAT CGTTTAAATC AGTATCATAT GCTTTGCGTA TCATAACTCT 7 920 

AGCTTAACAT TTTTTTGTGA AT AC AGGT T C TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 

GGAAAACGCC TTATGAAGTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TAGATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATT ATGTTG T TGTGTTATAG ATTGATTGAA 82 2 0 

T CT AACT AAA GG AT C C T ATT CAATTACTAG AACTATCACA TACTCAAGGT CAGCTCACAG 82 80 

ATGAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAG TCAAACAAAT AATTTAGTCA 8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG ATTACAAAAC AGAATGTAAT AGTGTTCTAC 8400 

AATTTTTACT AGATAAAACT GTAAATTCTG AAGGAAGGAT CACTTCTTCA ACAGAATTTG 84 60 

GAAATTTCGT AAGTAATTTA TCATTCCAAC ACGGAATAGC TGGACTACTG TTTCCTCTAA 852 0 

ATAAATTGTA CCCCCCAGAA CTGGATTCTA AAATACTCTC TATCATCAAG AAGGCAGTGA 8580 

CAATTAGAAC GACACACACA TATGAATATC AATACTCACT GCTATTTGGT GATGCAGGCT 8640 

ATCTATGGTT ACTCCTACAT TTATTTTCTA TCAGTAAAAA TCAATACTAT CTACAATTAG 8700 

CAAACGTCAC CGCTAAAAAA TTAATAGAGA ATTATGATAC TCTAGAGGAA AT AG ACT TT G 87 60 

CATTGGGAAA ATCTGGTGTC CT ATT AT CAT TAATAAAATA CTATCAATTT ACCAATGACA 8820 

ATACTCTTAA AATTTTCATC CACAATAGTA TAGGGGAAAT TTATCATTAT TTCCTACAAA 88 8 0 

GAG AT AC AG C CAAAGAAAGC ATTTTAGACT ATAGCTTTGC T CAT GG AT AT TGTGGAATTG 8940 

CATATGCTTT ATTTGCCTAT TCTAAAGTCT TAGAACCTTC TATGTTTTAT AATGATCTCC 9 000 

ATACATTCCA TACTGAATTA AAAAAATTAT TAGAAAAAGT TACTTCTAAT ACTGAAAATT 90 60 

TAGGAAATTT ACAACTTTCT TGGTGCAAAG GAATTTCCGG AATAATCTTA TATCTTTGTA 9120 

TGTACGATTG TGACGGAAAC AAAGATATTA TTAGTAAATA TCAAGAATTT GTTTTTAACC 9180 

AT C AT CT AAA AATGATGACA GGATATTGCC ACGGAATAAC TAGCTTACTA CAAACCACTG 9240 

TCTACAATCA AAACAAATTA CTGATGAAAA AAATCCAACA GGTAATTTTA GCATGTTCTG 9300 

AACGAGATGA TCACGGTTTA CTGATGTTTC AAGGAGATAG TGGTAAAGCA GATTTGTTTG 93 60 

ACTTCGGAAT AGGAAGCATG GGGTATATTG GTGTCTATTA AATAATAAAT TCCCATTCGA 9420 

TGTGCAGACA TAAGGAGAAA AGTATGAAAT TATTTTGGAC AAACAACATA TAT AG AC AG T 94 80 

TGCTGCTAAA CAGCTGTTTT TCATCATTCG GCGACAGTAT TTTCTACCTC GCCATTATCA 9540 

ATTATGTGGC TCAGTACAAT TTCGCTCCGC TAGCGATTTT ACTGATTTCC ATTTCAGAGA 9 600 

TGGTTCCCCT ACTATCGCAA CTCTTTCTCG GGATTCTAGG AGATTTTCAA GAAAATAGAG 9660 

TCAAACACGC ACTCTGGATT GCCAAAATCA AAATCCTGCT CTACGCTATT TTGACAGTAT 9720 

TTCTCGTCTT GTCGCCCTTT TCATTAGTTT CAGTCATTAT GAT TGT CATC ATCAACCTCA 97 80 

TCTCTGACAC CTTGAGCTAC CTGTCTGCCT ACATGATGAA CGCCCTCTAC ATCAGTGTAA 9 84 0 

TTAAGGACGA CCTGCATGAT GCCATGGGGT TCAGGCAGTC TCTGATGAGG GTTGTCCGTA 9900 

TTGTCGCCAA TCTGGCTGGC GCATTCCTTA TCAATGTTAT AAGTATTCAA ACTATTTCCC 9 9 60 

TTATCAACAC TCTGACTTTT GTCATTGCCT TTTTGGGCCT GT AT GT T ATT CGACATACCT 1002 0 

TGTATGAGGT TGAAAAAAGA ATTGAAATGT C AC AT AC AG C ACTGAGTTTT AAGAAATATT 10080 
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TTCAACATCT TAAACAGTCG CTGGCTGTGC TCCTGAGGTT AAAAGATACC GTCATACTAC 10140 

TGTTTCTGAC GACCAGTATG ATTGCCATCT TGGATGTGTC CCCTCGGCTG ATTGCCCTCC 10200 

GCTTCATCCA ACAGACACTA GCACAACTGA GCATTGGGCA ACTCCTCGCC CTGCTCTCCA 102 60 

TCATCATGTC TTGTGGAGCT ATCCTTGGCA AT AT G ACC AG CAGTAATCTA TTTAAAAATA 103 2 0 

TCCGTTTCAC GCACCTCTTG GTTTTCTGTG AGATTTCCCT ATTGACTCTA ATAACTAGTA 10380 

TCCTTTGTCA AGCCTATATC GTAATTTTCA TGACCAGTTT CATCAGTTCT ACGATTATCG 10440 

GCATTCTCAG CCCTCGCCTA CAAGCAGCTG TCTTTGCCCA TATCCCCAGT GACAAGATGG 105 0 0 

GGACGGTGGG CTCTGCTCTG AGCACAGTGG ACATTCTCGC CCCGTCCCTG CTCTCCCTAT 10560 

TAGCCCTATC CATAGCATCG GGCGTTTCGG TGCAGTTAGC ATTGATATTT TTGTATCTTA 10 620 

TTTTAATTGC TCTTATCTTT TGTCAATGGT TAGTCAAGTT CAACACTCAT AACTAACGAA 106 80 

AAAGCATGTG TAGATTTCAC ATGCTTTTAA TCTCCCCAAT CGTCAGGTCA AGTACAACAA 10740 

AGTCACTTCT TTGATTAAGC GAGTGTTCTA ATATAATTAT AAGCGCCCTG TCATTACCGA 10 80 0 

ACCCATTCGC CATTATAGTT GACAGAATAG CCATCTACGG TCGTATTCAC TGCCAAAGCA 10860 

CCTGAGCTAT AAGCATAGTA CCAGTTGCCA TTGACCTGGA ACCAACCTGT CTTCATGTCT 10920 

CCATTACCTG CATTTAGGTA GTACCAAGTT GAACCATCTT GATACCAACC AGTTG C CAT A 109 80 

GCTCCTGATG AAC GG AG AT A GTACCATTTG TTCCCAAGGT TTTGCCAACC TGTTTTCATA 1104 0 

TCGCCATTTG GGTGGTCTAA ATAATACCAA GTGGTACCTT C C T GAT AC C A GCCAGTGGCC 11100 

ATTGCTCCTG AGG AAC GG AG GTAGTACCAC TT ATT AC C T A GATATTGCCA ACCTGTTTGC 11160 

AT AAT AC C AG T T GTTGG AT C TAGGTAGTAC CAAGTCGAAT CATCGTTTAT CCACCCCGCA 112 20 

CGTCTTTCAC CACCAAGGTA GTTTTCTCCA TTAATTTCCG TCTTAGCTAG ATAATACCAG 112 80 

T T AG ACTGAT CATAAAGCCA ACCTGTCTCT AAAGAATGAT TTTGATTAAA GTAATAGTTC 113 4 0 

GTATAATAAC GCTTCTCTTC TTTATCTTCT GAATCTTCAC GTTTTTCCCC GTACTTTCTT 114 00 

CCAACACTGT CTTTAGTTTT AATCTCTAAT GTTTTCCAAC CAACAAACTC TTGT AG C ACT 114 6 0 

CCATTTTTAT CGAAGTAGTA CCACTCTGAC TTTGGAAAAC CTTCTAATCT GATACCATTT 1152 0 

GGGTAAGGAC CAATTGTACT ACCTTTAGAT GGAAACGGGA TATATTGCCA GCCGACAACC 11580 

ATCTCTCCAG AT AG AG AAT C AAAATAATAG TACTTACCAT CAATCACTCG CCAGTAGGTT 11640 

TCTTTGAGGT CCCCCTTTTT GTAGTAGGTT CTTCCGTTTT CTTGGACAAA CTGCCATCCT 117 00 

TCAGAATCAT CTGCAAATAC TGTACTGGTC CCTAGCAAAC CAAAGAAAAA TACTGTCAGT 117 60 

CCAACTTGCA TAGTTTTTTT CAAAATTTTC ATCTATATAC CCTCCAATAT TAAATCCACT 1182 0 

CACCAGATGA GGCGAAATTA TAAACTTTAC CATCGATAGT TTGGCTACCT GTAACCATTG 1188 0 



WO 98/18931 



PCT/US97/19588 



985 

CTCCAGG 11887 

(2) INFORMATION FOR SEQ ID NO : 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7: 

CCGGTATGTT CTGGAATACT ACCAATCTAA GCTGGCTGTG CCCTACAGTT TTACAACCCT 60 

GTACGAATAC CTTAAGGAAT AT G AC CG ATT TTTCAGCTGG GTTTTGGAGT CTGGTATTTC 12 0 

AAACGCTGAT AAAATATCCG ATATTCCTTT ATCAGTTTTG GAAAATATGT CTAAGAAAGA 180 

CATGGAATCC TTTATCCTTT ATCTACGTGA ACGTCCCTTG CTGAATGCTA ATACAACAAA 2 40 

ACAAGGTGTT T C AC AG AC AA CTATCAATCG AACCTTATCA GCACTTTCTA GTCTTTACAA 300 

GTATCTAACC GAGGAGGTTG AAAACG AT C A GGGGGAACCT TATTTCTATC GTAATGTAAT 3 60 

GAAAAAAGTT TCCACCAAGA AAAAGAAAGA AACCCTTGCT G C C AG AG C TG AAAAT AT C AA 42 0 

GCAAAAACTC TTTCTAGGTG ATGAAACAGA AGGTTTTCTA ACTTATATCG AT C AAG AG C A 4 80 

CCCACAACAG CTTTCAAATC GAGCTCTCTC ATCATTCAAC AAAAATAAAG AACGAGATTT 54 0 

AGCCATTATT GCCCTTCTCT TGGCATCTGG TGTTCGCTTA TCTGAAGCTG TTAATCTAGA 600 

TCTAAGAGAT CTCAATCTAA AAATGATGGT TATTGATGTT ACTCGAAAAG GTTGCAAACG 6 60 

TGACTCAGTC AATGTCGCTG CTTTTGCTAA ACCTTATTTA G AG AATT AT C TGGCCATTCG 72 0 

GAATCAACGC TATAAAACGG AAAAAACAGA TACAGCCCTT TTTTTAACTC TCTACAGAGG 7 80 

TGTTCCTAAT CGTATCGATG CTTCTAGCGT TGAGAAAATG GTTGCTAAAT ACTCAGAGGA 84 0 

TTTTAAAGTG CGTGTAACAC CCCATAAACT GCGCCATACA CTAGCAACTA GGCTCTATGA 900 

TGCGACTAAA TCACAAGTTT TAGTCAGTCA CCAACTAGGA CATGCTAGCA CACAAGTCAC 9 60 

TGACCTCTAT ACCCATATTG TTAGTGATGA ACAAAAGAAT GCTCTGGATA GTTTATGATT 102 0 

TTACGTATTT TAAATTATGT AAATAAATAT CAAAAAAAGA AGTTGGCCAA CTTCTTTTTG 1080 

AT T TAT C C AA CTACCGCTTC AGCGATTTCT TCACGGCTAA TACCAGCGAA GTAGCGTGTG 1140 

ATATCAATGG TTTTTAGCGC CTTAAGAACA TCTTCGCGTT CGTATTTCAC CCCACGAAGG 12 00 

ACATCTTCTA CTGCAGCAAC GTCTTCAATA CCAAAGAAGT CACCATAAAT CTTGATGTCT 12 60 

TGGATTTTTG ATTCAGTAAC GTTAGCAAAG ACTTCAACCT T AC C ACT AGT GAATTTGATT 13 2 0 
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CCACGACGGA CGTTAAATTC AGGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 13 8 0 

GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 1440 

TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 162 0 

AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 174 0 

TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGG TTATTTAAAT CGTGGTAAAC AGCTCCACCA 18 60 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 192 0 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CAT C C AAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT AT T AAAAG C A 2 040 

GTGTCATTTG AATGAT T GAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 222 0 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2 2 80 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGGATTTC TCCGTATTTC 2 340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 24 00 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2 4 60 

AAACCTACTG CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2 52 0 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TC AG CGG AAA CTTCACCCAT GCGGAAAGCT 2 580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2 64 0 

GTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCCA 27 00 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2 82 0 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2 880 

TCCACATCCA TAGCTGGAAC T AT ACGGT C C ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3 000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG T CATC ACT AG TCATGACAAG TGGAGATTCC 3060 

AT AC C AGGG A CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 312 0 
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TCAAGCAATT C AG AAC C ATT TACCAAGACG TTCTTGTCTT TAGTGATTGT ACCAATTCCT 3180 

TTATGAACAG TAACTCCGTA GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 3240 

ACAACTTTAG ATTTAGTTTC TAAAAGTTTT TCCATATCAA CAGTGAAGTT AGGATTTTCA 3 3 00 

AT C AC GAT AC CACGATTTGC AGCATGACCG ATATTTTCAA TAATTTCAGC GTTATGAAGG 33 60 

TAGGTCTTGG TTGGAATACA TCCACGGTTT AAGCAGGTTC CACCAAGTTC AG AT T T CTC A 3 420 

ACAAGGGCAA CCTTACCGCC GAATTGGGCA GCTTTAATGG C T GC AAC AT A ACCAGCAGGA 34 80 

CCTCCACCAA T C AC AACG AT ATCAAAAGCA TCATCGCTCT TACCATCATC GTTTGAGGTA 3 54 0 

CTTGCTACAG GTACAGGGCT AGCTTCTGGC GATGCTGCTC CAGCTGTTGG GATGTTTTCC 3 6 00 

CTTTCTTCAC CAAGGTAACC GATAACTTCC GTTACAGGGA CAGTTTCACC ATCTCCTTTG 3 6 60 

AGAATGGCAA TCAAGTACCC ATCTTCTTCG GCTTCCAATT CCATGCTGAC TTTATCAGTC 3 720 

ATGATTTCCA AAAGGATTTC TCCTTCTTTT ACAAATTCTC CGACTTTTTT ATTCCATTGG 3 7 80 

ACGATTTGTC CTTCTGTCAT ATCCACGCCG GCTTTTGGCA TAATTACTTC TAAGGCCATG 3 84 0 

TCTTCCTTCC TTTATCTATA TCTTAAAAAT GAATACTCTT GCTCTTAAAT TAACATTGAG 3 9 00 

ATTGGCGTTT CAATCAACTC TTTCAAGTCC TTCATAAACT TAGCACCAGC CAT AC CAT CT 3 9 60 

ACGACACGGT GGTCAATGGT TAATCCTAAA CTC AT GAT T G GGCGAATCAC AATTTCACCA 402 0 

TTGACGACAA CTGGCTTCTC GATTGTCGAA CTGACACCAA GGATAGCTGA GTTGGGTTGG 4080 

TTAATAATCG GACCAAAGGA CTGAACACCA AACATTCCCA AATTACTGAT TGTGAATGTT 4140 

GAATTTTGTA ACTCACTTGG AGCCAATTTA CCATCCAAGG TACGGCCAAT AACATCCTTA 4200 

AAGGCTACAA CCAGTTCTGA AAGACTCATC TTCTCAGCAT TGTAAACAAC AGGTGTCATC 42 60 

AATCCATTAT CCATCCCAAC TGCCATGGCA AGATTGACAT AGTTGTGAGT GATAATAGTC 4 320 

TTGCCATCTT CTGTCAATGA AGCGTTGATG TATGGGTGTT TCATAAGAGT CTTAACAACT 43 80 

GCAAGCGAAA GAAGGTCTGT TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 4440 

AG AAC CT T CT TACGAAGAGC CAACATTTCA GTCATATCAA CTTCATAGTT GAGGGTGAAG 4500 

GTTGGCGCAG TCAAGTAAGA TTCAACCATG CGTTGGGCAA TAACCTTACG CATTGGTGTC 45 60 

ATTGGAATAC GCTCGATTTT ACCATATGGT GTTACGTTAT CAGGGACTTC TTCCACTTTT 4 620 

T C AATCTG AG CAGGAGATTT GATGCTATCG TTTTCGATAT TTTCAGGAAG CAGGGCCAAA 468 0 

ACATCCTTCT TCATGATTTT ACCACGATGA CCGGTTCCTT GGATTTCCTG CCAAGCAATG 474 0 

TTATGTTCGA GGGCAATTCG TTTTGCAAGT GGCGAAATGC GAACCACGTT TGTGTCTTTA 4 800 

TAAGTTTCCA CGTCTTCTTT GTGGACACGA CCGTTTGCAC CTGAGCCAGA AACGTCGTAG 48 6 0 
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AGGTTTATCC 


CTAAATCATC 


CGCTAACTTT 


CTAGCTGCAG 


GAGTCGCTCT 


TAGCTTGTCA 


4920 


TCAGCCATGA 


CCTCTCCAAT 


TCTATTTATG 


ATACAAAGGG 


CGTCAAAAGC 


GACTGAAAAA 


4980 


TAGGAAATCG 


ACGATGGCTT 


CGATGAAGCC 


AAGG AG AT TT 


ATCTTTTTTC 


CGATCTTTTA 


5040 


GCCCGTGCTC 


TAATCTAAGA 


T AT T AATG AC 


GAAGAGCTCT 


GCACCTAAAA 


GATACAAAGT 


5100 


TTCTCGTCAG 


CTTTATTTTA 


TTTACATAAC 


TTATCTTATG 


TAACCCTATT 


CTTTGTTATA 


5160 


AGTTTTTCGG 


ATTGCATCTT 


TG AT AC T T T C 


AACTGTTGGA 


ATCATTGCAT 


TTTCTAGGTT 


5220 


TTGTGCATAA 


GGCATCGGCA 


CATCTTCTCC 


TGCACAACGG 


C G AATTGGTG 


CATCTAGATA 


5280 


GTCAAATGCT 


TCTGATTCTG 


AAATAATAGC 


TGAAATTTCA 


CCGATATAGC 


CACTTGTTTT 


5340 


GTGGGCATCG 


TTGACCAGAA 


CAACCTTACC 


AGTCTTCTTC 


ACTGAGTTTA 


TGATGATATC 


5400 


CTTATCAAGC 


GGAACAAGGG 


TACGTGGGTC 


AACAATTTCA 


ACTGAAATTC 


CTTCTTCTGC 


5460 


TAATTCTTCA 


GCAGCTTGAA 


CCACACGGCG 


AAGCATTTTT 


CCATAAGTAA 


C AAC TG TT AC 


5520 


ATCCGTTCCT 


TGGCGTTTGA 


TTTCACCAAC 


CCCAAGTGGA 


ATTGTGTAGT 


CTGGATCAAC 


5580 


TGGCACTTCC 


CCTTTTTGGT 


TAAATTCTGA 


CTTGTACTCA 


AGTATAATAA 


CTGGGTTGTT 


5640 


ATCACGGATA 


GAAGACTTAA 


GCAGGCCTTT 


CATGTCCGCA 


GGTGTTCCAG 


GTGCCACAAC 


5700 


CTTAAGTCCT 


GGAATGTGAG 


TAAACCAAGA 


C T C TAG AG AT 


TGTGAGTGCT 


GGGCGGCAGA 


5760 


GCCAACTCCG 


TTACCAGCTG 


CACAACGAAC 


AG T CAT T GG A 


ACCTGACCTT 


TACCACCAAA 


5820 


CATGTAACGT 


GTTTTAGCAG 


CTTGGTTGAC 


GATATTGTCC 


ATGGCAATAA 


CAGAGAAGTC 


5880 


CATGAAGGTC 


ATATCGACGA 


TTGGACGAAG 


TCCTGTCATG 


GCTGCTCCTG 


CTGCTGCTCC 


5940 


AG AG ATGG C A 


GCTTCAGAAA 


T C GG AC AG T C 


ACGGACACGT 


TCTGGACCAA 


ATTCTTCAAG 


6000 


CATTCCAACA 


GAAGTACCGA 


AGTCTCCTCC 


GAAGACACCG 


ACGTCTTCTC 


CCATCAAGAA 


6060 


CACATTTTCA 


TCGCGACGCA 


TTTCCTCAGA 


CATAGCAAGG 


ATAATGGTGT 


C AC GG AAGG A 


6120 


CATTGTTTTT 


GTTTCCATTT 


TATCTCTTTC 


TCCTTAGTCT 


GCGTAAATAT 


CTTCAAAGGC 


6180 


TGATTCAAGC 


GGTGGGAATG 


GGCTTTCCTC 


TGCAAATTTA 


ACAGAAGCTT 
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CTTTACTTGC 


GCTTGGATTT 


CTTCCAATTC 


TTCGGCACTT 


GCAATGTTAT 


TTTCAATAAG 


6300 


GTAATTGCGG 


AGGTTTTCGA 


TTGGATCTTT 


TTGTTTCCAC 


AATTCCACTT 


CTTCACGCGT 


6360 


ACG AT AT T T A 


CCAGGGTCAG 


ATGATGAGTG 


ACCGAGCCAG 


CGATAAGTTA 


CACTTTCAAT 


6420 


CAAGACTGGA 


CCATTGCCAC 


TGCGAACATG 


GTCCACAGCT 


TTCTGAAATC 


CTTCATAGAC 


6480 


ATCGATGACA 


TTGTTACCGT 


CTTCGATGAA 


CATTCCAGGA 


ATTCCATAAG 


CGGCGCTACG 


6540 


T TG ATGG AT A 


TGTTCTATAT 


TGGTCATTTT 


CTTGATATCC 


GCAGAGATAC 


CGTAACCGTT 


6600 


GTTAATGCAA 


TAGAAAATGA 


CTGGCAGGTT 


CCAGATAGAA 


GCCATGTTCA 


CTGCTTCGTG 


6660 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 672 0 

TTGCATTTGC TGACTGAGGG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC 67 80 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AG CG AT AT G C ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 69 60 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 7 02 0 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTT ACT AT T T TACCGCTTTT GGCAAATACT GTCAAAGTTT TTCTAAAAGA 72 00 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 72 60 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7320 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 73 80 

GTAAGAATAA GCTCCAAAAT CTGTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 7440 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTG TATCCTTATA CCCCCGCAAA 7 500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7 5 60 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7 62 0 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7 680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 774 0 

AT AGC C AT AG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTT C AT A 7 8 00 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7 8 60 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG G C AG AC CT AA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 804 0 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 822 0 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 82 80 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 84 00 
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AGATGGTGAC CAATGATAGG CACCAAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 84 60 

GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT C CAT AG T GT T GTATTGACCT 852 0 

GTCATTGCAG TATCAACAAA AG AG G C AG AA TAATTGGCAA ATTGGTAGAT CAGGATTGGG 858 0 

AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT AT AC AT AC TT 8 64 0 

CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 8700 

CGGAGGTTTA TTAGATTTTG AAGTAGTATG CCAACACGCA CATGTACGAC AATAATAGCT 87 60 

TCTAACTAAA CCTCCGTTAT CATATTGAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 8 82 0 

TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA C T ATT AG AT T TCGCTTGTCT 8880 

CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCTA TCAAGCTTGA TAGACGATTT 8 940 

GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 9000 

TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA T C AAAAAT AG 9060 

TGATATCTGC TGGACCATTC TCAGCCAAGT AACCTGCTTC AAAGTTGTAA AGCTTGGCTG 912 0 

GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 918 0 

AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 92 40 

TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA AT AACTGT G A 9 3 00 

TG AC AC CTG A TTTGAGACCT TCGATAACGG CACGACGGTC TGATTCCAAA CGAAGCGGTG 93 60 

GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGAAGTGC TTCTGTCTTA GAGAAATGCT 9420 

GTGGCGCTAC TTCTGCTGTG ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 94 80 

CACTTTCTTC CTTAGACAAA TGCTGGATGT GAACATGGGC TTTAGTTGCA TAGGCAATCA 954 0 

TGACATCACG CG C CAT CAT A GCGT AC T C AG CCACCCCAGT AGCACCGCAG ATATGGAAAT 9600 

GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTCCTCAT 9 6 60 

GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 97 20 

TGCTCTCAAG CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTTCT AAGAGTGCCT 97 8 0 

TAAAGTCAGT CAAGTTTTTA CCATTAAAGT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 9840 

TAATCTTCTC TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 99 00 

TTGGACTGGT ATTAGCCATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 9960 

CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC AC GG AAATG A ACATGAATAT 1002 0 

CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 10080 

TG AT CT C AG A CGCAATTTTG ATAATTTTCC CATCTTGAAC TAAGACATCA CAAACTTGAT 1014 0 

CCAAACCAGA CTTGGGATCC AT T AC ACG AC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 102 00 
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TTTATTCATA GAAATCAACT TGGGT AT C C A AC AATTT AT C CCCATCATAA ACAAACTTGG 102 60 

CTGAAAAGAA GGGTTTATCC TCTAAAAGCC ACT C AAC AAA GGTGTGGTCA CCTTCCCAAG 1032 0 

TCGGCTTGCT CAAAACCTCA TCATAGGGAA C C C AT T CT AG CGTCCCCTCA TTGCAGTCAA 10380 

TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT GTACCAGTCT AAATCTGGTG 10440 

TAAATTCAGG AAAAGTGATG ACACCTTTTA GAACTGGCTT GGCTTTGAGC CCTGTTTCTT 10500 

CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCGTCTCTCC TCTCTCTAGC TTACCACCCA 10560 

CACCAATCCA TTTCCCTTCA TGGACATCAT TGGGTTTCTT ATTACGATGG AGCATGAGCA 10620 

GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG AGGCATATTT TCTCCTTATC 10680 

TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA TGCATTGGCC TTGGAAAAGG 1074 0 

GCTTGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT TGGATGGGCT GATTCGATAA 10800 

TCAAGTGATG AGGATTGGTA ACTAATGCCT TCTTCTTACG TGCATAAGCT CCCCAGAGTA 10860 

CAAAAACGAC TGGTCTATCT AG AT GAT T G A CCACCTGAAT CACAGCATCA GTAAAAGGCT 1092 0 

CCCAGATTTG ACCAGCATGA CCATTGGCCT GTCCAGCAGG AACAGTCAAA CAAGCATTAA 10980 

GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG TCAAATCATG AGATTTCTTA ACTCCGATAT 11040 

CAT C TG AC AA TTCTTTCAAG ATATTTTGCA AGGATGGTGG AGCTGGGATA GAGTCAGGTA 11100 

CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA GGGGTCTTGC CCTAGAATTA 11160 

CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG AAAAACCTTT TCCTTGGGTG 112 2 0 

GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG ATTGATTTTC C CGAAAT AAC 112 80 

CCTCAGGTAA TTGCGCCTTA AT C AAAG CAT GCCAAGACGA GTGTTCCATA GCCGACTCGG 1134 0 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 6 0 

TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 120 

GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 180 

CGATTGAGGT G ATTTC AC G A GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 2 40 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 300 
TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 3 60 

ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 42 0 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 4 80 
GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 54 0 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 60 0 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 6 60 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCC AG A AGTTACGATT GAT ATT AC CT 72 0 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 84 0 

ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 10 8 0 

CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 12 00 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 12 60 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 132 0 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 13 80 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTT AC TAT T A GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 162 0 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 16 80 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 1860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC C T ACTT G AAA GG AAAAT CAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 19 80 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2 04 0 
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CTCTTTTCTT GG AC T TAT AC TTCTGAGAAC ATTTCCCTTA TGCCAGCGAT TATTTCCACC 2100 

GTTATTCTGG TCTTTGGTGC TCTTCTTTTA GCCTTGCCCA TAGGGATTTT TGCTGGTTTT 2160 

TATCTTGTGG AATATACAAA AAAAG ATT C C CTTTGTGTTA AAATCATGCG ATTGGCCTCA 2 2 20 

GATACCTTAT CTGGGATTCC TTCCATTGTT TTTGGTCTGT TTGGCATGCT CTTCTTTGTA 2 280 

GTCTTCTTAG GTTTTCAATA CTCTCTGTTA TCAGGAATCT TAACCTCAGT TATCATGGTG 2 3 40 

TTGCCAGTCA TTATTCGCTC AACAGAAGAA GCCCTTTTAT CTGTTAGTGA TAGCATGCGT 2400 

CAAGCAAGTT ATGGACTTGG GGCAGGTAAG TTACGGACTG TTTTTAGAAT TGTTCTACCA 2 4 60 

GTTGCCATGC C AG GT ATT T T AGCTGGAGTG ATACTAGCTA TTGGCCGTAT CGTTGGTGAA 2 520 

ACAGCTGCCC TCATGTATAC ATTAGGTACC TCTACCAATA CGCCAAGTAG TCTCATGTCT 2580 

TCAGGCCGTT CTCTAGCCCT ACATATGTAT ATGCTGTCAA GTGAGGGGCT ACATGTCAAT 2 640 

GAAGCCTATG CTACCGGCGT GATTTTGATT ATTACTGTTT TAATGATAAA TACTCTATCA 2 7 00 

AG CTT AT T AT CTCGAAAACT TGTGAAAGGA GCTTCCTAGT ATGGGAACAT TTTCAGTCAG 2 7 60 

ACACCTAGAC TTATTTTACG GGGATTTTCA AGCCTTAAAA AATATTTCGA TTCAATTACC 2 82 0 

AGAAAGACAG ATTACTGCCT TGATAGGCCC ATCTGGTTGT GGCAAATCAA CTTTTCTAAA 28 80 

AACCCTTAAC CGGATGAACG ATTTGGTTCC TTCTTGCCAT ATTGAAGGCC AAGTCCTCTT 2 94 0 

AGATGAGCAA GATATTTATA GTAGCAAATT CAACCTTAAT CAGCTACGTA AGCGTGTAGG 3 000 

GATGGTTTTT CAACAGCCTA ATCCCTTTGC CATGTCTATC TATGATAACG TGGCTTATGG 3 060 

CCCAAGGACA CATGGTATTC GAGACAAAAA ACAATTAGAT GCCTTAGTGG AGAAATCTTT 3120 

AAAAGGGGCA GCCATTTGGG AAGAAGTCAA AGATGATCTT AAAAAGAGTG CCATGTCCTT 3180 

ATCTGGCGGT CAGCAGCAAC GCCTTTGCAT TGCGCGAGCT TTAGCAGTAG AACCTGATAT 3 2 40 

TCTGTTAATG GATGAGCCGA CTTCAGCCTT AGACCCTATC TCCACTTTAA AAATTGAAGA 3 300 

CCTCATTCAG CAACTAAAAA AGG AT T AT AC GATTATCATT GTTACCCATA ACATGCAACA 3 3 60 

AGCTTCACGT ATTTCAGATA AAACTGCTTT TTTCTTAACA GGAGAAATTT GCGAATTTGG 3420 

AGATACCGTT GACGTGTTTA CCAATCCAAA AGATCAGCGC ACAGAAGACT ATATTTCAGG 34 80 

ACGGTTCGGA TAAGGAAGGA AAAACCTATG AGAAATCAAT TTGACTTAGA ATTGCATGAA 3 540 

TTAGAACAAT CCTTTTTAGG AC T AGGGCAA CTTGTCCTTG AAACAGCTTC AAAAGCCTTA 3 600 

CTGGCCTTAG CCTCCAAAGA CAAGGAGATG GCAGAGCTAA TTATCAATAA GGATCATGCT 3 6 60 

ATCAACCAAG GTCAAAGCGC TATCGAATTG ACCTGTGCCC GTTTGTTGGC CTTGCAGCAG 3 720 

CCACAAGTGT CTGACCTTCG ATTTGTGATT AGCATCATGT CTTCTTGTTC AGACCTTGAA 3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 

CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3 900 

GAT T TAT T GG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3 960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG AC TT ATG AAA 4 020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 40 8 0 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 42 00 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA G T GT AG AT AG TGAGGAGTTG 42 60 

TTTCAATTCT ATCGTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG T ATG ATG ATG 43 80 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 4440 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 45 00 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4 560 

GCCTTATTTG ATACAATCTG C C AG AT AAAT CTG AT ACT C C CTGTTCCATA TAGAGCTAGA 4 620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAG C AT C AAA 468 0 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 4740 

TCGAACATCA GAAATTAAGC AAAAT C AC C A GAAGGACAGT ATTTCAACTA GCTTTTCTGG 4 800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC C AG AT AT G AA TAATTTGGGA TGATAAATCT 4 860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 4 92 0 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 49 80 

T AG GT ATT CC TTATCCAACT C T AT AT AACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 504 0 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA AT TC AG ACT C ATCACCACAA AATTCTGAAC GACTGATTT r 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC 5220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA C T G AATC AAA CTATCTATCA GATTTCTTCT 52 80 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 53 4 0 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 54 60 

AACCGCCCTT TCACAAGTCT TAAATGACAA T GG AC AC GAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA AT C AATG AAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 55 80 
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CGTTCTAGAC G AAAAT AT C A TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 5 640 

TGCGATTTTG TTTGTTGTCC CAACAAAAGT G AC AC G ACTT GTTGCCCAGC AAGTTGCACA 57 00 

AACCTTGGAC CATAAGGTTA T CAT CATG C A CGCATCAAAG GGATTAGAAC CTGATAGCCA 57 60 

TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 582 0 

CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CGTGACCTAA CTTTAATAAC 58 8 0 

TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5 940 

CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 

TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 60 6 0 

AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 612 0 

TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 

CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 62 40 

AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTATGAACT 63 00 

AGCCCAAGAA CTTGGAGTCT AT ATGC C C AT TACACAGGCT ATTTACCAAG TTATTTATCA 63 6 0 

CGGAACCAAT AT C AAAG AT G CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 64 20 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GC T GG ACT AG G AAC T CG ATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA AT CGT AG AC A AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6 600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 66 60 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 672 0 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 67 80 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 684 0 

GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7 020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 70 8 0 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 72 60 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7 380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 744 0 

CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 75 00 

CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 7 5 60 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAATTTTAAA TTTGACTTAA 7 62 0 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7 680 

GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GGTGTAATTT TAATTTTAAA 774 0 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7 800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7 860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7 920 

AAAAGATAGC AAATCAGACA AAGAACAAGT T G AT AAACT A TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 804 0 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 816 0 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 82 20 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 82 80 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAAGCTG CTGAAAAAGC 83 40 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT T CAT CTG AAC AAACTAATGG 84 60 

AT AT T C AAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 85 80 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC T AC AAAG AT C ATAATAATGG 8 64 0 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 87 00 

TAGTCAACGC TAATAACTAT TTTAG AG CTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8 82 0 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 894 0 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 90 00 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 912 0 
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GAAAGATAAC TGCTAGAAAT GCGCCACCAA CTGCACCACC GATATGGCCT GCTAGGCTGA 9180 

TTCCTGGAAT CAGAACACTT CCAATAATGT TAACCACAAA AAGTGTCAGA TAGGATTGCC 92 4 0 

CTAGCTGTTG GATATAAGGA TTGCGAGTTG CAT AG C G AAG AACAATAATC GCGGCAAATA 9300 

GCCCATAAAG AGAGGTAGAG GCGCCTGCTG CTAAGGATTT AGGACTAAAT ACAAAAACAA 93 60 

AGAGATTGCC CAT CAT T C CT GATAAAAGAT AGAGAAAGAA AAACTGCTTA GAACCGAAAA 942 0 

TCTCCTCTAC CTGCCTTCCA AGATAATAAA GTGAAAGCAT ATTAACAATG AAATGTTCCC 9480 

ACCCAATATG AACAAAAATG GCAGACAAGA GACGCCAAAC CTGCTCGGGA AAGAGGCGAA 954 0 

TAGCTGGCCC ATACATGGCT CCAAATCGAA ATAATGTATC TGCCCTGTCA AAGTTTCCGC 9600 

CTGCAGTGAC CAACATTAGT AAAAAT AC C A AGGCCGTCAC T AAG AG G AAG AAACTCGTCA 9 6 60 

CAGGGTAACG TCTATCAAAG ATTTCCTTCA TCAATTAATA CCTCCTGAAC AGGAATATCA 972 0 

TGGTTTTCAG GTATAAAGTC CTGAATTTGA CAAGGATATA TCGTACTCAA AGTACGACCA 9780 

GAAAAATGTT CCAGATAGCG GTCATAATAG CCTCCACCGT ATCCTATCCG ATATCCTTTC 9 84 0 

GTCGTAAAAG CCAGACCAGG AACATGAATC AAATCAATCT GAGATGCATC CACCACTTCC 9900 

AAATCTCCCT GTAGCTCCAG TAAGGCAAAG AAAGTTTTTA CCAACTGTTG CGGATCATAG 9960 

AC C AC AAAGT CCATGCGCCC CTTGGGATAA GTTTTGGGTA TTAAAACCTT CTTGCCGTCC 1002 0 

TTCAGCGCCT GCTCAATCAG TTCCTGCGTT TGAAACTCAT GAGAAAAAGA GAGGTAGGTT 10 080 

GCGATGACCT TGGCTTCTTG ATAAAAGGGG TGTTGTAAAA GCCGCTCGGT TAAAGCTTGG 1014 0 

TCTATAGCCT GTTTTTGCTC TTGAGATATA GCCTTCATTT CATGCAAGAC TTGCTTGCGT 10200 

AATTCCGATT TCATAGACAA GCCCTCTATT CTGCTGCCTT CTTTTTCAGG AAACTAGACA 10260 

CCGCAGCCAC CCCAATAGCT AAGACTTCTT CCTTAGGACT CATTTGAGGG TGATGAAGAG 10320 

CGTAGGGACT ATCGATACCT AGCCAAAACA TCACGCCATC AACCTTTGAA AGGAGATAAC 103 8 0 

CAAAGTCCTC GCCTGTCATA GCAGGTTCGA TATCAATCAA CTCGATTCCG TCTTTTTCGT 1044 0 

CAAAGAAGTC CATCAGTTCA CGCGCCAAGG CTGGATTGTT CTCAACAGGT AGGTATCCAC 10500 

CTTGTTTGAG TTCCACTTCG ACTTCCATAT CAAAGGCAGC TGCAACCCCT TCTGCAACTG 10560 

TTTTTACCCT CTTTTGCACC AAG AG ACT C A TGTCCTGTGT CAAGGCACGA ATAGTTCCAT 10 62 0 

GTAAAAAAGC TGTGTCTGTG ATGACATTGT TGGTGGTTCC AGCTTGAAAA ACGCCGAAGG 10 680 

TCACCACTGC TCCCTCGATT GGGTTGACAT TGCGGCTAAC AACTGACTGC ACTTGGGTCA 10740 

CAAAGTAACT AGCCGCCACC AAGGCGTCAT TGGCTTCATG AGGAAAAGCT GCGTGGCCAC 10800 

CTTTGCCTTT GAAACGGATC TTCACCTCGC AAGTTCCTGC AAAGAGTGTA TGAGTATTAG 10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG AC C AT AG AAT TGATCTGGCA 10920 

ACCAATCTCC AAAAGCACCG TCCTCATACA T GAG CAT AC C ACCAGCTTCA TTTTCTTCAG 109 80 

CAGGCTGAAA TAGAAAGAGC AG AT TAT T C T TGGGTTGCTC CTCAAGGGCG CGCTCAAGAC 11040 

AGCCTAAGGC AATGGTCATA TGAAAATCAT GG AC AC AG G C ATGCATGCGA CCTTGGTGTT 11100 

GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG G C CAT C AAT A TCTGTCCGCC 11160 

AAC C AATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC CAAAATCCCT GTCCGCCAAG 1122 0 

TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC AAT C AC AT C C AGCAAATAAG 112 8 0 

CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG GTGTAAATCT CGTCTAGTCT 113 4 0 

GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA AAGAGGCTGG AAAAAGGGTT 114 00 

CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG CATCCTCTAG CGCTGTTTTT 114 60 

TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG GAACACCTGC TACTACCACG 11520 

TTTTCTGGGA CATCTTGGGT AAC AAT AG CT CCTGCTGCGA CAACTGAACC ACTACCGATT 115 8 0 

TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA CATTGTCTCC GACACGGACT 11640 

GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG CACCTGCACC AACGTGGCTA 117 00 

TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA TG T C AAT CAT GGTTCCAGCA 117 60 

CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA TAACAGCATT GTCACCAATT 11820 

TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG CGTTGATAGC ACGCTTATCT 118 80 

AGCAAAGGAA CTGC AG AAT T ACGAGCATCT TGCTCGACAA CATAATCTTG ATTTTCTACC 11940 

AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTCTCCGA ATAGGACATT TCCTAGTTTG 12000 

ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA AGGTTACTTT GACACTGGTT 12060 

TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG CGTTCATTTT TGTAGCAGTC 1212 0 

ATAGGTG 1212 7 
(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 9: 

CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCGCGAAGA 6 0 

GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 12 0 



